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Preface 



The basic topic of this book is solving problems from system and control theory using 
convex optimization. We show that a wide variety of problems arising in system 
and control theory can be reduced to a handful of standard convex and quasiconvex 
optimization problems that involve matrix inequalities. For a few special cases there 
are "analytic solutions" to these problems, but our main point is that they can be 
solved numerically in all cases. These standard problems can be solved in polynomial- 
time (by, e.g., the ellipsoid algorithm of Shor, Nemirovskii, and Yudin), and so are 
tractable, at least in a theoretical sense. Recently developed interior-point methods 
for these standard problems have been found to be extremely efficient in practice. 
Therefore, we consider the original problems from system and control theory as solved. 

This book is primarily intended for the researcher in system and control theory, 
but can also serve as a source of application problems for researchers in convex op- 
timization. Although we believe that the methods described in this book have great 
practical value, we should warn the reader whose primary interest is applied control 
engineering. This is a research monograph: We present no specific examples or nu- 
merical results, and we make only brief comments about the implications of the results 
for practical control engineering. To put it in a more positive light, we hope that this 
book will later be considered as the first book on the topic, not the most readable or 
accessible. 

The background required of the reader is knowledge of basic system and control 
theory and an exposure to optimization. Sontag's book Mathematical Control The- 
ory [SON90] is an excellent survey. Further background material is covered in the 
texts Linear Systems [KAI80] by Kailath, Nonlinear Systems Analysis [Vid92] by 
Vidyasagar, Optimal Control: Linear Quadratic Methods [AM90] by Anderson and 
Moore, and Convex Analysis and Minimization Algorithms L [HUL93] by Hiriart- 
Urruty and Lemarechal. 

We also highly recommend the book Lnterior-point Polynomial Algorithms in Con- 
vex Programming [NN94] by Nesterov and Nemirovskii as a companion to this book. 
The reader will soon see that their ideas and methods play a critical role in the basic 
idea presented in this book. 
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Chapter 1 

Introduction 

1.1 Overview 

The aim of this book is to show that we can reduce a very wide variety of prob- 
lems arising in system and control theory to a few standard convex or quasiconvex 
optimization problems involving linear matrix inequalities (LMIs). Since these result- 
ing optimization problems can be solved numerically very efficiently using recently 
developed interior-point methods, our reduction constitutes a solution to the original 
problem, certainly in a practical sense, but also in several other senses as well. In com- 
parison, the more conventional approach is to seek an analytic or frequency-domain 
solution to the matrix inequalities. 

The types of problems we consider include: 

• matrix scaling problems, e.g., minimizing condition number by diagonal scaling 

• construction of quadratic Lyapunov functions for stability and performance anal- 
ysis of linear differential inclusions 

• joint synthesis of state-feedback and quadratic Lyapunov functions for linear 
differential inclusions 

• synthesis of state-feedback and quadratic Lyapunov functions for stochastic and 
delay systems 

• synthesis of Lur'e-type Lyapunov functions for nonlinear systems 

• optimization over an affine family of transfer matrices, including synthesis of 
multipliers for analysis of linear systems with unknown parameters 

• positive orthant stability and state-feedback synthesis 

• optimal system realization 

• interpolation problems, including scaling 

• multicriterion LQG/LQR 

• inverse problem of optimal control 

In some cases, we are describing known, published results; in others, we are extending 
known results. In many cases, however, it seems that the results are new. 

By scanning the list above or the table of contents, the reader will see that Lya- 
punov's methods will be our main focus. Here we have a secondary goal, beyond 
showing that many problems from Lyapunov theory can be cast as convex or quasi- 
convex problems. This is to show that Lyapunov's methods, which are traditionally 
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applied to the analysis of system stability, can just as well be used to find bounds on 
system performance, provided we do not insist on an "analytic solution" . 

1.2 A Brief History of LMIs in Control Theory 

The history of LMIs in the analysis of dynamical systems goes back more than 100 
years. The story begins in about 1890, when Lyapunov published his seminal work 
introducing what we now call Lyapunov theory. He showed that the differential equa- 
tion 

^ x (t) = Ax(t) (1.1) 

is stable (i.e., all trajectories converge to zero) if and only if there exists a positive- 
definite matrix P such that 

A T P + PA<0. (1.2) 

The requirement P > 0, A T P + PA < is what we now call a Lyapunov inequality on 
P, which is a special form of an LMI. Lyapunov also showed that this first LMI could 
be explicitly solved. Indeed, we can pick any Q = Q T > and then solve the linear 
equation A T P+PA = —Q for the matrix P, which is guaranteed to be positive-definite 
if the system (1.1) is stable. In summary, the first LMI used to analyze stability of a 
dynamical system was the Lyapunov inequality (1.2), which can be solved analytically 
(by solving a set of linear equations). 

The next major milestone occurs in the 1940's. Lur'e, Postnikov, and others in 
the Soviet Union applied Lyapunov's methods to some specific practical problems 
in control engineering, especially, the problem of stability of a control system with a 
nonlinearity in the actuator. Although they did not explicitly form matrix inequalities, 
their stability criteria have the form of LMIs. These inequalities were reduced to 
polynomial inequalities which were then checked "by hand" (for, needless to say, small 
systems). Nevertheless they were justifiably excited by the idea that Lyapunov's 
theory could be applied to important (and difficult) practical problems in control 
engineering. From the introduction of Lur'e's 1951 book [Lur57] we find: 

This book represents the first attempt to demonstrate that the ideas ex- 
pressed 60 years ago by Lyapunov, which even comparatively recently ap- 
peared to be remote from practical application, are now about to become a 
real medium for the examination of the urgent problems of contemporary 
engineering. 

In summary, Lur'e and others were the first to apply Lyapunov's methods to practical 
control engineering problems. The LMIs that resulted were solved analytically, by 
hand. Of course this limited their application to small (second, third order) systems. 
The next major breakthrough came in the early 1960's, when Yakubovich, Popov, 
Kalman, and other researchers succeeded in reducing the solution of the LMIs that 
arose in the problem of Lur'e to simple graphical criteria, using what we now call 
the positive-real (PR) lemma (see §2.7.2). This resulted in the celebrated Popov 
criterion, circle criterion, Tsypkin criterion, and many variations. These criteria could 
be applied to higher order systems, but did not gracefully or usefully extend to systems 
containing more than one nonlinearity. From the point of view of our story (LMIs in 
control theory) , the contribution was to show how to solve a certain family of LMIs 
by graphical methods. 

Copyright © 1994 by the Society for Industrial and Applied Mathematics. 



1.2 A Brief History of LMIs in Control Theory 3 

The important role of LMIs in control theory was already recognized in the early 
1960's, especially by Yakubovich [Yak62, Yak64, Yak67]. This is clear simply from 
the titles of some of his papers from 1962-5, e.g., The solution of certain matrix 
inequalities in automatic control theory (1962), and The method of matrix inequalities 
in the stability theory of nonlinear control systems (1965; English translation 1967). 

The PR lemma and extensions were intensively studied in the latter half of the 
1960s, and were found to be related to the ideas of passivity, the small-gain criteria 
introduced by Zames and Sandberg, and quadratic optimal control. By 1970, it was 
known that the LMI appearing in the PR lemma could be solved not only by graphical 
means, but also by solving a certain algebraic Riccati equation (ARE). In a 1971 
paper [Wil71b] on quadratic optimal control, J. C. Willems is led to the LMI 



A T P +PA+Q PB + C T 
B T P + C R 



>0, 



(1.3) 



and points out that it can be solved by studying the symmetric solutions of the ARE 

A T P +PA- {PB + C T )R- 1 {R T P + C) + Q = 0, 

which in turn can be found by an eigendecomposition of a related Hamiltonian matrix. 
(See §2.7.2 for details.) This connection had been observed earlier in the Soviet Union, 
where the ARE was called the Lur'e resolving equation (see [Yak88]). 

So by 1971, researchers knew several methods for solving special types of LMIs: 
direct (for small systems), graphical methods, and by solving Lyapunov or Riccati 
equations. From our point of view, these methods are all "closed-form" or "analytic" 
solutions that can be used to solve special forms of LMIs. (Most control researchers 
and engineers consider the Riccati equation to have an "analytic" solution, since the 
standard algorithms that solve it are very predictable in terms of the effort required, 
which depends almost entirely on the problem size and not the particular problem 
data. Of course it cannot be solved exactly in a finite number of arithmetic steps for 
systems of fifth and higher order.) 

In Willems' 1971 paper we find the following striking quote: 

The basic importance of the LMI seems to be largely unappreciated. It 
would be interesting to see whether or not it can be exploited in compu- 
tational algorithms, for example. 

Here Willems refers to the specific LMI (1.3), and not the more general form that 
we consider in this book. Still, Willems' suggestion that LMIs might have some 
advantages in computational algorithms (as compared to the corresponding Riccati 
equations) foreshadows the next chapter in the story. 

The next major advance (in our view) was the simple observation that: 

The LMIs that arise in system and control theory can be formulated as 
convex optimization problems that are amenable to computer solution. 

Although this is a simple observation, it has some important consequences, the most 
important of which is that we can reliably solve many LMIs for which no "analytic 
solution" has been found (or is likely to be found). 

This observation was made explicitly by several researchers. Pyatnitskii and Sko- 
rodinskii [PS82] were perhaps the first researchers to make this point, clearly and 
completely. They reduced the original problem of Lur'e (extended to the case of mul- 
tiple nonlinearities) to a convex optimization problem involving LMIs, which they 
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4 Chapter 1 Introduction 

then solved using the ellipsoid algorithm. (This problem had been studied before, but 
the "solutions" involved an arbitrary scaling matrix.) Pyatnitskii and Skorodinskii 
were the first, as far as we know, to formulate the search for a Lyapunov function as 
a convex optimization problem, and then apply an algorithm guaranteed to solve the 
optimization problem. 

We should also mention several precursors. In a 1976 paper, Horisberger and Be- 
langer [HB76] had remarked that the existence of a quadratic Lyapunov function that 
simultaneously proves stability of a collection of linear systems is a convex problem 
involving LMIs. And of course, the idea of having a computer search for a Lya- 
punov function was not new — it appears, for example, in a 1965 paper by Schultz et 
al. [SSHJ65]. 

The final chapter in our story is quite recent and of great practical importance: the 
development of powerful and efficient interior-point methods to solve the LMIs that 
arise in system and control theory. In 1984, N. Karmarkar introduced a new linear 
programming algorithm that solves linear programs in polynomial-time, like the ellip- 
soid method, but in contrast to the ellipsoid method, is also very efficient in practice. 
Karmarkar's work spurred an enormous amount of work in the area of interior-point 
methods for linear programming (including the rediscovery of efficient methods that 
were developed in the 1960s but ignored). Essentially all of this research activity con- 
centrated on algorithms for linear and (convex) quadratic programs. Then in 1988, 
Nesterov and Nemirovskii developed interior-point methods that apply directly to con- 
vex problems involving LMIs, and in particular, to the problems we encounter in this 
book. Although there remains much to be done in this area, several interior-point 
algorithms for LMI problems have been implemented and tested on specific families 
of LMIs that arise in control theory, and found to be extremely efficient. 

A summary of key events in the history of LMIs in control theory is then: 

• 1890: First LMI appears; analytic solution of the Lyapunov LMI via Lyapunov 
equation. 

• 1940's: Application of Lyapunov's methods to real control engineering prob- 
lems. Small LMIs solved "by hand" . 

• Early 1960's: PR lemma gives graphical techniques for solving another family 
of LMIs. 

• Late 1960's: Observation that the same family of LMIs can be solved by solving 
an ARE. 

• Early 1980's: Recognition that many LMIs can be solved by computer via 
convex programming. 

• Late 1980's: Development of interior-point algorithms for LMIs. 

It is fair to say that Yakubovich is the father of the field, and Lyapunov the grandfather 
of the field. The new development is the ability to directly solve (general) LMIs. 

1.3 Notes on the Style of the Book 

We use a very informal mathematical style, e.g., we often fail to mention regularity or 
other technical conditions. Every statement is to be interpreted as being true modulo 
appropriate technical conditions (that in most cases are trivial to figure out). 

We are very informal, perhaps even cavalier, in our reduction of a problem to 
an optimization problem. We sometimes skip "details" that would be important if 
the optimization problem were to be solved numerically. As an example, it may be 

Copyright © 1994 by the Society for Industrial and Applied Mathematics. 



1.4 Origin of the Book 5 

necessary to add constraints to the optimization problem for normalization or to ensure 
boundedness. We do not discuss initial guesses for the optimization problems, even 
though good ones may be available. Therefore, the reader who wishes to implement 
an algorithm that solves a problem considered in this book should be prepared to 
make a few modifications or additions to our description of the "solution" . 

In a similar way, we do not pursue any theoretical aspects of reducing a problem 
to a convex problem involving matrix inequalities. For example, for each reduced 
problem we could state, probably simplify, and then interpret in system or control 
theoretic terms the optimality conditions for the resulting convex problem. Another 
fascinating topic that could be explored is the relation between system and control 
theory duality and convex programming duality. Once we reduce a problem arising 
in control theory to a convex program, we can consider various dual optimization 
problems, lower bounds for the problem, and so on. Presumably these dual problems 
and lower bounds can be given interesting system-theoretic interpretations. 

We mostly consider continuous-time systems, and assume that the reader can 
translate the results from the continuous-time case to the discrete-time case. We 
switch to discrete-time systems when we consider system realization problems (which 
almost always arise in this form) and also when we consider stochastic systems (to 
avoid the technical details of stochastic differential equations) . 

The list of problems that we consider is meant only to be representative, and 
certainly not exhaustive. To avoid excessive repetition, our treatment of problems 
becomes more terse as the book progresses. In the first chapter on analysis of lin- 
ear differential inclusions, we describe many variations on problems (e.g., computing 
bounds on margins and decay rates); in later chapters, we describe fewer and fewer 
variations, assuming that the reader could work out the extensions. 

Each chapter concludes with a section entitled Notes and References, in which we 
hide proofs, precise statements, elaborations, and bibliography and historical notes. 
The completeness of the bibliography should not be overestimated, despite its size 
(over 500 entries). The appendix contains a list of notation and a list of acronyms 
used in the book. We apologize to the reader for the seven new acronyms we introduce. 

To lighten the notation, we use the standard convention of dropping the time 
argument from the variables in differential equations. Thus, x = Ax is our short 
form for dx/dt = Ax(t). Here A is a constant matrix; when we encounter time- 
varying coefficients, we will explicitly show the time dependence, as in x = A(t)x. 
Similarly, we drop the dummy variable from definite integrals, writing for example, 
Jo u T ydt for L u(t) T y(t) dt. To reduce the number of parentheses required, we adopt 
the convention that the operators Tr (trace of a matrix) and E (expected value) have 
lower precedence than multiplication, transpose, etc. Thus, Tr A T B means Tr (A T £>) . 

1.4 Origin of the Book 

This book started out as a section of the paper Method of Centers for Minimizing 
Generalized Eigenvalues, by Boyd and El Ghaoui [BE93], but grew too large to be 
a section. For a few months it was a manuscript (that presumably would have been 
submitted for publication as a paper) entitled Generalized Eigenvalue Problems Aris- 
ing in Control Theory. Then Feron, and later Balakrishnan, started adding material, 
and soon it was clear that we were writing a book, not a paper. The order of the 
authors' names reflects this history. 
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Chapter 2 

Some Standard Problems 
Involving LMIs 

2.1 Linear Matrix Inequalities 

A linear matrix inequality (LMI) has the form 

m 

F{x)=F +J2 x i F i>°> ( 2J ) 

i=\ 

where x € R m is the variable and the symmetric matrices Fi = Ff G R nx ™, i = 
0,...,m, are given. The inequality symbol in (2.1) means that F(x) is positive- 
definite, i.e., u T F(x)u > for all nonzero u € R". Of course, the LMI (2.1) is 
equivalent to a set of n polynomial inequalities in x, i.e., the leading principal minors 
of F(x) must be positive. 

We will also encounter nonstrict LMIs, which have the form 

F(x) > 0. (2.2) 

The strict LMI (2.1) and the nonstrict LMI (2.2) are closely related, but a precise 
statement of the relation is a bit involved, so we defer it to §2.5. In the next few 
sections we consider strict LMIs. 

The LMI (2.1) is a convex constraint on x, i.e., the set {x \ F(x) > 0} is convex. 
Although the LMI (2.1) may seem to have a specialized form, it can represent a 
wide variety of convex constraints on x. In particular, linear inequalities, (convex) 
quadratic inequalities, matrix norm inequalities, and constraints that arise in control 
theory, such as Lyapunov and convex quadratic matrix inequalities, can all be cast in 
the form of an LMI. 

Multiple LMIs F^(x) > 0, . . . , F (p )(x) > can be expressed as the single LMI 
diag(F < - 1 '(x), . . . ,F( p '(x)) > 0. Therefore we will make no distinction between a set 
of LMIs and a single LMI, i.e., "the LMI F^\x) > 0, . . . , F^(x) > 0" will mean "the 
LMIdiag(F^(x),...,F^\x)) > 0". 

When the matrices Fi are diagonal, the LMI F(x) > is just a set of linear 
inequalities. Nonlinear (convex) inequalities are converted to LMI form using Schur 
complements. The basic idea is as follows: the LMI 

Q(x) S(x) 1 , x 

' >0, 2.3 

S(x) T R(x) 
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where Q(x) = Q(x) T , R(x) = R(x) T , and S(x) depend affinely on x, is equivalent to 

R(x)>0, Q(x) - S{x)R(x)- 1 S{x) T > 0. (2.4) 

In other words, the set of nonlinear inequalities (2.4) can be represented as the 
LMI (2.3). 

As an example, the (maximum singular value) matrix norm constraint ||Z(x)|| < 1, 
where Z(x) G H pxq and depends affinely on x, is represented as the LMI 



/ Z{x) 

Z{x) T I 



>0 



(since \\Z\\ < 1 is equivalent to I — ZZ T > 0). Note that the case q = 1 reduces to a 
general convex quadratic inequality on x. 

The constraint c(x) P(x)~ 1 c(x) < 1, P(x) > 0, where c(x) G R™ and P(x) = 
P(x) T G R, nx ™ depend affinely on x, is expressed as the LMI 



P(x) c(x) 
c{x) T 1 



>0. 



More generally, the constraint 

TrSixfPix^Six) < 1, P{x) > 0, 

where P(x) = P(x) T G R nx " and S(x) G R nxp depend affinely on x, is handled by 
introducing a new (slack) matrix variable X = X T G R pxp , and the LMI (in x and 
X): 



TrX< 1, 



X S(x) T 
S{x) P{x) 



>0. 



Many other convex constraints on x can be expressed in the form of an LMI; see the 
Notes and References. 



2.1.1 Matrices as variables 

We will often encounter problems in which the variables are matrices, e.g., the Lya- 
punov inequality 

A T P+ PA<0, (2.5) 

where A G R" x " is given and P = P T is the variable. In this case we will not write out 
the LMI explicitly in the form F(x) > 0, but instead make clear which matrices are 
the variables. The phrase "the LMI A T P + PA < in P" means that the matrix P is 
a variable. (Of course, the Lyapunov inequality (2.5) is readily put in the form (2.1), 
as follows. Let Pi, ... , P m be a basis for symmetric n x n matrices (m = n{n + l)/2). 
Then take Po = and P, = —A Pi — PjA) Leaving LMIs in a condensed form 
such as (2.5), in addition to saving notation, may lead to more efficient computation; 
see §2.4.4. 

As another related example, consider the quadratic matrix inequality 

A T P+ PA+ PBR- 1 B T P + Q <0, (2.6) 

Copyright © 1994 by the Society for Industrial and Applied Mathematics. 



2.2 Some Standard Problems 9 

where A, B, Q = Q T , R = R T > are given matrices of appropriate sizes, and 
P = P T is the variable. Note that this is a quadratic matrix inequality in the variable 
P. It can be expressed as the linear matrix inequality 



-A T P - PA-Q PB 
B T P R 



>0. 



This representation also clearly shows that the quadratic matrix inequality (2.6) is 
convex in P, which is not obvious. 



2.1.2 Linear equality constraints 

In some problems we will encounter linear equality constraints on the variables, e.g. 

P > 0, A T P+ PA<0, TrP=l, (2.7) 

where P G R x is the variable. Of course we can eliminate the equality constraint to 
write (2.7) in the form F(x) > 0. Let Pi, ... , P m be a basis for symmetric kxk matrices 
with trace zero (in = (k(k + l)/2) — 1) and let Pq be a symmetric kxk matrix with 
Tr P = 1. Then take F = diag(P , -A T P - P A) and F { = diag(P i; -A T P % - P % A) 
for i = 1, . . . , m. 

We will refer to constraints such as (2.7) as LMIs, leaving any required elimination 
of equality constraints to the reader. 

2.2 Some Standard Problems 

Here we list some common convex and quasiconvex problems that we will encounter 
in the sequel. 

2.2.1 LMI problems 

Given an LMI F(x) > 0, the corresponding LMI Problem (LMIP) is to find x eas such 
that F(x lcas ) > or determine that the LMI is infeasible. (By duality, this means: 
Find a nonzero G > such that Tr GFi = for i = 1 , . . . , m and Tr GFq < 0; see the 
Notes and References.) Of course, this is a convex feasibility problem. We will say 
"solving the LMI F(x) > 0" to mean solving the corresponding LMIP. 

As an example of an LMIP, consider the "simultaneous Lyapunov stability prob- 
lem" (which we will see in §5.1): We are given Ai G R nxn , i = 1, . . . , L, and need to 
find P satisfying the LMI 

P>0, AjP+PAi<0, i = l,...,L, 

or determine that no such P exists. Determining that no such P exists is equivalent 
to finding Qo, ■ . . , Ql, not all zero, such that 

L 
Qo>0,...,Q L >0, Qo = '%2(QiA[ + A i Q i ), (2.8) 

i=l 

which is another (nonstrict) LMIP. 
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10 Chapter 2 Some Standard Problems Involving LMIs 

2.2.2 Eigenvalue problems 

The eigenvalue problem (EVP) is to minimize the maximum eigenvalue of a matrix 
that depends affinely on a variable, subject to an LMI constraint (or determine that 
the constraint is infeasible), i.e., 

minimize A 

subject to XI - A(x) > 0, B(x) > 

where A and B are symmetric matrices that depend affinely on the optimization 
variable x. This is a convex optimization problem. 

EVPs can appear in the equivalent form of minimizing a linear function subject 
to an LMI, i.e., 

minimize c T x 

(2.9) 
subject to F(x) > 

with F an affine function of x. In the special case when the matrices Fi are all diagonal, 
this problem reduces to the general linear programming problem: minimizing the 
linear function c T x subject to a set of linear inequalities on x. 
Another equivalent form for the EVP is: 

minimize A 

subject to A(x, A) > 

where A is affine in (a;, A). We leave it to the reader to verify that these forms are 
equivalent, i.e., any can be transformed into any other. 

As an example of an EVP, consider the problem (which appears in §6.3.2): 



minimize 7 

-A T P -PA- C T C PB 

B T P 7/ 



subject to 



>0, P>0 



where the matrices A £ K nxn , B £ K nxp , and C £ R mx " are given, and P and 7 are 
the optimization variables. From our remarks above, this EVP can also be expressed 
in terms of the associated quadratic matrix inequality: 

minimize 7 

subject to A T P + PA + C T C + j~ 1 PBB T P<0, P>0 

2.2.3 Generalized eigenvalue problems 

The generalized eigenvalue problem (GEVP) is to minimize the maximum generalized 
eigenvalue of a pair of matrices that depend affinely on a variable, subject to an LMI 
constraint. The general form of a GEVP is: 

minimize A 

subject to XB{x) - A{x) > 0, B{x) > 0, C(x) > 

where A, B and C are symmetric matrices that are affine functions of x. We can 
express this as 

minimize A max (^4(x), B{x)) 
subject to B(x) > 0, C(x) > 
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2.2 Some Standard Problems 11 

where A max (V, Y) denotes the largest generalized eigenvalue of the pencil XY — X 
with Y > 0, i.e., the largest eigenvalue of the matrix Y~ 1 / 2 XY~ 1 / 2 . This GEVP is 
a quasiconvex optimization problem since the constraint is convex and the objective, 
X max (A(x), B(x)), is quasiconvex. This means that for feasible x, x and < 9 < 1, 

A max (A(0cc + (1 - 9)x),B{9x + (1 - 9)x)) < 

m.ax{X max (A(x),B(x)),X max (A(x),B(x))} . 

Note that when the matrices are all diagonal and A(x) and B(x) are scalar, this 
problem reduces to the general linear- fractional programming problem, i.e., minimiz- 
ing a linear-fractional function subject to a set of linear inequalities. In addition, 
many nonlinear quasiconvex functions can be represented in the form of a GEVP with 
appropriate A, B, and C; see the Notes and References. 

An equivalent alternate form for a GEVP is 

minimize A 

subject to A(x, A) > 

where A(x, A) is affine in x for fixed A and affine in A for fixed x, and satisfies the 
monotonicity condition X > fi => A (x, A) > A{x, fj,). As an example of a GEVP, 
consider the problem 

maximize a 

subject to -A T P -PA- 2aP > 0, P > 

where the matrix A is given, and the optimization variables are the symmetric matrix 
P and the scalar a. (This problem arises in §5.1.) 

2.2.4 A convex problem 

Although we will be concerned mostly with LMIPs, EVPs, and GEVPs, we will also 
encounter the following convex problem, which we will abbreviate CP: 

minimize logdet A(x)^ 1 

6 V ; (2.10) 

subject to A(x) > 0, B(x) > 

where A and B are symmetric matrices that depend affinely on x. (Note that when 
A > 0, logdet A^ 1 is a convex function of A.) 

Remark: Problem CP can be transformed into an EVP, since det A(x) > A can 
be represented as an LMI in x and A; see [NN94, §6.4.3]. As we do with variables 
which are matrices, we leave these problems in the more natural form. 

As an example of CP, consider the problem: 

minimize logdet P^ 1 

(2.11) 
subject to P > 0, vfPvi <1, i = 1, . . . ,L 

here Vi G R™ are given and P = P T 6 j^ nx ™ j s ^j^g variable. 

We will encounter several variations of this problem, which has the following 
interpretation. Let £ denote the ellipsoid centered at the origin determined by P, 
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12 Chapter 2 Some Standard Problems Involving LMIs 

£ = \z \ z T Pz < 1 }. The constraints are simply vi € £. Since the volume of £ is pro- 
portional to (det P) -1 ' 2 , minimizing logdet P _1 is the same as minimizing the volume 
of £. So by solving (2.11), we find the minimum volume ellipsoid, centered at the ori- 
gin, that contains the points v±, . . . , Vl, or equivalently, the polytope Co{t>i, . . . , vl}, 
where Co denotes the convex hull. 

2.2.5 Solving these problems 

The standard problems (LMIPs, EVPs, GEVPs, and CPs) are tractable, from both 
theoretical and practical viewpoints: 

• They can be solved in polynomial-time (indeed with a variety of interpretations 
for the term "polynomial-time"). 

• They can be solved in practice very efficiently. 

By "solve the problem" we mean: Determine whether or not the problem is fea- 
sible, and if it is, compute a feasible point with an objective value that exceeds the 
global minimum by less than some prespecified accuracy. 

2.3 Ellipsoid Algorithm 

We first describe a simple ellipsoid algorithm that, roughly speaking, is guaranteed 
to solve the standard problems. We describe it here because it is very simple and, 
from a theoretical point of view, efficient (polynomial-time). In practice, however, the 
interior-point algorithms described in the next section are much more efficient. 

Although more sophisticated versions of the ellipsoid algorithm can detect in- 
feasible constraints, we will assume that the problem we are solving has at least one 
optimal point, i.e., the constraints are feasible. (In the feasibility problem, we consider 
any feasible point as being optimal.) The basic idea of the algorithm is as follows. We 
start with an ellipsoid £' ' that is guaranteed to contain an optimal point. We then 
compute a cutting plane for our problem that passes through the center x^> of £^°> . 
This means that we find a nonzero vector g(°> such that an optimal point lies in the 
half-space {z | g^ T {z - x (0) ) < 0} (or in the half-space {z | g^ T (z - x'- "') < 0}, 
depending on the situation) . (We will explain how to do this for each of our problems 
later.) We then know that the sliced half-ellipsoid 



£^n{z \g^ T (z-x^)<()} 



contains an optimal point. Now we compute the ellipsoid £^> of minimum volume 
that contains this sliced half-ellipsoid; £ < - 1 > is then guaranteed to contain an optimal 
point. The process is then repeated. 

We now describe the algorithm more explicitly. An ellipsoid £ can be described 

as 

£= {z | {z-a) T A- 1 {z-a) < l} 
where A = A T > 0. The minimum volume ellipsoid that contains the half-ellipsoid 

{z | {z-a) T A- 1 {z-a) < 1, 5 T (^-a)<0} 
is given by 

£=lz {z-a) T A~ 1 {z-a) < l|, 
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2.3 Ellipsoid Algorithm 13 

where 

Ag - m 2 ( A 2 T 



A Agg 1 A 



m + 1 ' m 2 — 1 \ m + 1 

and g = g/ \/g T Ag. (We note that these formulas hold only for m > 2. In the case of 
one variable, the minimum length interval containing a half-interval is the half-interval 
itself; the ellipsoid algorithm, in this case, reduces to the familiar bisection algorithm.) 
The ellipsoid algorithm is initialized with x^ ' and A^ ' such that the corresponding 
ellipsoid contains an optimal point. The algorithm then proceeds as follows: for 
fc=l,2,... 

compute a g^ k > that defines a cutting plane at x^ k ' 

~g:={g(k)T A {k) g {k)y^ 2 g {k) 

x (k+i) ._ (fc) _ -1-A^g 

m-\-l - y 

A (k+1) ^_ ( A (k) _ _^ A (k)--T A (k)\ 

This recursion generates a sequence of ellipsoids that are guaranteed to contain an 
optimal point. It turns out that the volume of these ellipsoids decreases geometrically. 
We have 

vol(£W) <e-^vol(£(°>), 

and this fact can be used to prove polynomial-time convergence of the algorithm. (We 
refer the reader to the papers cited in the Notes and References for precise statements 
of what we mean by "polynomial-time" and indeed, "convergence," as well as proofs.) 

We now show how to compute cutting planes for each of our standard problems. 
LMIPs: Consider the LMI 

m 

F{x) = F +J2 x i F i>°- 

i=l 

If x does not satisfy this LMI, there exists a nonzero u such that 
u T F(x)u = u T I F + ^ XiF, J u < 0. 

Define g by gi = —u T FiU, i = 1, . . . , m. Then for any z satisfying g T (z — x) > we 
have 

u T F(z)u = u T I F + Y^ z i F i ) u = u T F(x)u - g T (z - x) < 0. 

It follows that every feasible point lies in the half-space {z \ g T (z — x) < 0}, i.e., this 
g defines a cutting plane for the LMIP at the point x. 

EVPs: Consider the EVP 

minimize c T x 
subject to F(x) > 
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14 Chapter 2 Some Standard Problems Involving LMIs 

Suppose first that the given point x is infeasible, i.e., F{x) ^ 0. Then we can con- 
struct a cutting plane for this problem using the method described above for LMIPs. 
In this case, we are discarding the half-space {z | g T (z — x) > 0} because all such 
points are infeasible. 

Now assume that the given point x is feasible, i.e., F(x) > 0. In this case, the 
vector g = c defines a cutting plane for the EVP at the point x. Here, we are discarding 
the half-space {z \ g T (z — x) > 0} because all such points (whether feasible or not) 
have an objective value larger than x, and hence cannot be optimal. 

GEVPs: We consider the formulation 

minimize A max (^4(x), B{x)) 

subject to B(x) > 0, C(x) > 

Here, A(x) = A + Y,7Li XiA u B(x) = B Q + YT=i x i B i and C( x ) = Co + YT=i x i^i- 
Now suppose we are given a point x. If the constraints are violated, we use the method 
described for LMIPs to generate a cutting plane at x. 

Suppose now that x is feasible. Pick any m/0 such that 

(X max (A(x),B(x))B(x) - A{x)) u = 0. 

Define g by 

g% = -u T {\ mayL {A{x) 1 B{x))B i - AAu, i = l,...,m. 

We claim g defines a cutting plane for this GEVP at the point x. To see this, we note 
that for any z, 

u T (X max (A{x),B{x))B(z) - A(z)) u = -g T {z - x). 
Hence, if g T (z — x) > we find that 

\ max (A(z),B(z)) > \ max (A(x),B(x)) 

which establishes our claim. 

CPs: We now consider our standard CP (2.10). When the given point x is infeasible 
we already know how to generate a cutting plane. So we assume that x is feasible. In 
this case, a cutting plane is given by the gradient of the objective 

— logdetvl(x) = 

at x, i.e., gi = — Tr AiA(x)^ 1 . Since the objective function is convex we have for all 

z, 

logdet A{zy x > logdet^(a;) _1 + g T \z - x). 

In particular, g T (z — x) > implies logdet A(z)~ 1 > logdet ^(a;)^ 1 , and hence z 
cannot be optimal. 

2.4 Interior-Point Methods 

Since 1988, interior-point methods have been developed for the standard problems. 
In this section we describe a simple interior-point method for solving an EVP, given 
a feasible starting point. The references describe more sophisticated interior-point 
methods for the other problems (including the problem of computing a feasible starting 
point for an EVP). 
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2.4 Interior-Point Methods 15 

2.4.1 Analytic center of an LMI 

The notion of the analytic center of an LMI plays an important role in interior-point 
methods, and is important in its own right. Consider the LMI 

m 

F(x) = F +J2xiFi>0, 
i=i 

where F t = Ff £ R" XTl . The function 

A J logdetF^)- 1 ^(x)>0 
4>{x) = < (2.12) 

I oo otherwise, 

is finite if and only if F(x) > 0, and becomes infinite as x approaches the boundary 
of the feasible set {x\F(x) > 0}, i.e., it is a barrier function for the feasible set. (We 
have already encountered this function in our standard problem CP.) 

We suppose now that the feasible set is nonempty and bounded, which implies 
that the matrices F\, . . . , F m are linearly independent (otherwise the feasible set will 
contain a line). It can be shown that (f> is strictly convex on the feasible set, so it has 
a unique minimizer, which we denote x*: 

x* = arg min 4>{x). (2.13) 



We refer to x* as the analytic center of the LMI F{x) > 0. Equivalently, 

x* = arg max detF(x), (2-14) 

F(x) > 

that is, F(x*) has maximum determinant among all positive-definite matrices of the 
form F(x). 

Remark: The analytic center depends on the LMI (i.e., the data Fq, . . . ,F m ) 
and not its feasible set: We can have two LMIs with different analytic centers but 
identical feasible sets. The analytic center is, however, invariant under congruence 
of the LMI: F(x) > and T T F(x)T > have the same analytic center provided 
T is nonsingular. 

We now turn to the problem of computing the analytic center of an LMI. (This 
is a special form of our problem CP.) Newton's method, with appropriate step length 
selection, can be used to efficiently compute x* , starting from a feasible initial point. 
We consider the algorithm: 

x (k+i) ._ x (k) _ a (*) ff(x (*))-i s ( it (*)) ] (2.15) 

where aS k > is the damping factor of the fcth iteration, and g{x^ ') and H(x^ k ') denote 
the gradient and Hessian of <fi, respectively, at x^ '. In [NN94, §2.2], Nesterov and 
Nemirovskii give a simple step length rule appropriate for a general class of barrier 
functions (called s elf- concordant) , and in particular, the function cf>. Their damping 
factor is: 

a(«J ! m if ^)< 1/4, 

[ 1/{1 + S{x^)) otherwise, 
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16 Chapter 2 Some Standard Problems Involving LMIs 

where 

S(x) = yJg(x) T H(x)-ig(x) 

is called the Newton decrement of 4> at x. Nesterov and Nemirovskii show that this 
step length always results in x^ k+ ^ feasible, i.e., F(x^ k+1 ') > 0, and convergence of 

x^ to X*. 

Indeed, they give sharp bounds on the number of steps required to compute x* 
to a given accuracy using Newton's method with the step length (2.16). They prove 
that 4>(x( k ') — 4>(x*) < e whenever 

k > ci + c 2 loglog(l/e) + c 3 (<t>{x {0) ) - 4>{x*)) , (2-17) 

where Ci, C2, and C3 are three absolute constants, i.e., specific numbers. The first 
and second terms on the right-hand side do not depend on any problem data, i.e., 
the matrices Fq, . . . , F m , and the numbers m and n. The second term grows so slowly 
with required accuracy e that for all practical purposes it can be lumped together with 
the first and considered an absolute constant. The last term on the right-hand side 
of (2.17) depends on how "centered" the initial point is. 

2.4.2 The path of centers 

Now consider the standard EVP: 
minimize c T x 
subject to F{x) > 

Let A opt denote its optimal value, so for each A > A opt the LMI 

F(x) > 0, c T x < A (2.18) 

is feasible. We will also assume that the LMI (2.18) has a bounded feasible set, and 
therefore has an analytic center which we denote a;*(A): 

x*(X) = arg min I logdet F(x)~ + log 

\ A — c 

x 

The curve given by x*(\) for A > A opt is called the path of centers for the EVP. 
It can be shown that it is analytic and has a limit as A — > A opt , which we denote x° pt . 
The point x opt is optimal (or more precisely, the limit of a minimizing sequence) since 
for A > A opt , x*(X) is feasible and satisfies c T x*(X) < A. 

The point x*(X) is characterized by 

_d_ 

dx 



logdet F(x) 1 + log 



x*(X) V A c x / (2.19) 

TrF(x*(\))- 1 F i +- J— — =0, i = l,...,m. 



X-c T x*{X) 



2.4.3 Method of centers 



The method of centers is a simple interior-point algorithm that solves an EVP, given a 
feasible starting point. The algorithm is initialized with A*- ) and x^ ' , with F(x^ ') > 
and c T x^> < X^ ' , and proceeds as follows: 

A (fc+1) . = {1 _ 6)c T x (k) + eX (k) 

x( fe +D := x*(A( fc+1 )) 
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2.4 Interior-Point Methods 17 

where 9 is a parameter with < 9 < 1. 

The classic method of centers is obtained with 9 = 0. In this case, however, x^ ' 
does not (quite) satisfy the new inequality c T x < \( k+1 > . With 9 > 0, however, the 
current iterate x^ ' is feasible for the inequality c T x < \( k+1 > , F(x) > 0, and therefore 
can be used as the initial point for computing the next iterate cc*(A^ +1 ') via Newton's 
method. Since computing an analytic center is itself a special type of CP, we can view 
the method of centers as a way of solving an EVP by solving a sequence of CPs (which 
is done using Newton's method). 

We now give a simple proof of convergence. Multiplying (2.19) by {x\ — x° p ) 
and summing over i yields: 

TrF(x^)- 1 (F(x {k) ) ~ F(* opt )) = m ^ c T x(k) cT ^ (k) ~ x ° Pt )- 
Since Tr F{x ( - k y)- 1 F{x°P t ) > 0, we conclude that 

Replacing c T X^ by (A< fe+1 ) - 9\^)/{l - 9) yields 
(A (fe+i) _ A op t) < !L+0 (A (fc) _ A o Pt)j 

which proves that X- k > approaches A opt with at least geometric convergence. Note that 
we can also express the inequality above in the form 

c T x (k) _ A opt < n(A (fc) _ c T x (k)^ 

which shows that the stopping criterion 

until (a« - c T x^ < e/nj 

guarantees that on exit, the optimal value has been found within e. 

We make a few comments here, and refer the reader to the Notes and References for 
further elaboration. First, this variation on the method of centers is not polynomial- 
time, but more sophisticated versions are. Second, and perhaps more important, we 
note that two simple modifications of the method of centers as described above yield 
an algorithm that is fairly efficient in practice. The modifications are: 

• Instead of the Nesterov-Nemirovskii step length, a step length chosen to mini- 
mize 4> along the Newton search direction (i.e., an exact line-search) will yield 
faster convergence to the analytic center. 

• Instead of defining x*(A) to be the analytic center of the LMI F(x) > 0, c T x < A, 
we define it to be the analytic center of the LMI F(x) > along with q copies 
of c T x < A, where q > 1 is an integer. In other words we use 

x*(\) = arg min logdet F(x)~ + qlo 



A — c T x 
x 

Using q > 1, say q k, n where F(x) G R™ x ", yields much faster reduction of A 
per iteration. 

Among interior-point methods for the standard problems, the method of centers 
is not the most efficient. The most efficient algorithms developed so far appear to be 
primal-dual algorithms (and variations) and the projective method of Nemirovskii; see 
the Notes and References. 
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18 Chapter 2 Some Standard Problems Involving LMIs 

2.4.4 Interior-point methods and problem structure 

An important feature of interior-point methods is that problem structure can be ex- 
ploited to increase efficiency. The idea is very roughly as follows. In interior-point 
methods most of the computational effort is devoted to computing the Newton direc- 
tion of a barrier or similar function. It turns out that this Newton direction can be 
expressed as the solution of a weighted least-squares problem of the same size as the 
original problem. Using conjugate- gradient and other related methods to solve these 
least-squares systems gives two advantages. First, by exploiting problem structure in 
the conjugate-gradient iterations, the computational effort required to solve the least- 
squares problems is much smaller than by standard "direct" methods such as QR or 
Cholesky factorization. Second, it is possible to terminate the conjugate-gradient iter- 
ations before convergence, and still obtain an approximation of the Newton direction 
suitable for interior-point methods. See the Notes and References for more discussion. 

An example will demonstrate the efficiencies that can be obtained using the tech- 
niques sketched above. The problem is an EVP that we will encounter in §6.2.1. 

We are given matrices Ai, . . . , Al £ R" ", symmetric matrices D\, . . . , Dl, E £ 
R" xn . We consider the EVP 

minimize TV EP 

2.20 
subject to Af P + PA t + D t < 0, i = l,...,L 

In this problem the variable is the matrix P, so the dimension of the optimization 
variable is m = n(n + l)/2. When the Lyapunov inequalities are combined into one 
large LMI F(x) > 0, we find that F(x) £ R NxN with N = Ln. This LMI has much 
structure: It is block-diagonal with each block a Lyapunov inequality. 

Vandenberghe and Boyd have developed a (primal-dual) interior-point method 
that solves (2.20), exploiting the problem structure. They prove the worst-case esti- 
mate of 0(m 2 ' 75 L 15 ) arithmetic operations to solve the problem to a given accuracy. 
In comparison, the ellipsoid method solves (2.20) to a given accuracy in 0(m 35 L) 
arithmetic operations (moreover, the constant hidden in the O(-) notation is much 
larger for the ellipsoid algorithm). 

Numerical experiments on families of problems with randomly generated data 
and families of problems arising in system and control theory show that the actual 
performance of the interior-point method is much better than the worst-case estimate: 
0(m a LP) arithmetic operations, with a w 2.1 and (3 w 1.2. This complexity estimate 
is remarkably small. For example, the cost of solving L Lyapunov equations of the 
same size is 0(m 15 L). Therefore, the relative cost of solving L coupled Lyapunov 
inequalities, compared to the cost of solving L independent Lyapunov inequalities 
is O(m 0e L 02 ). This example illustrates one of our points: The computational cost 
of solving one of the standard problems (which has no "analytic solution") can be 
comparable to the computational cost of evaluating the solution of a similar problem 
that has an "analytic solution" . 



2.5 Strict and Nonstrict LMIs 

We have so far assumed that the optimization problems LMIP, EVP, GEVP, and CP 
involve strict LMIs. We will also encounter these problems with nonstrict LMIs, or 
more generally, with a mixture of strict and nonstrict LMIs. 
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2.5 Strict and Nonstrict LMIs 19 

As an example consider the nonstrict version of the EVP (2.9), i.e. 

minimize c T x 
subject to F(x) > 

Intuition suggests that we could simply solve the strict EVP (by, say, an interior-point 
method) to obtain the solution of the nonstrict EVP. This is correct in most but not 
all cases. 

If the strict LMI F(x) > is feasible, then we have 



{x e R m | F(x) >0} = {xeR m | F{x) > 0}, (2.21) 

i.e., the feasible set of the nonstrict LMI is the closure of the feasible set of the strict 
LMI. It follows that 

inf {c T x | F{x) > } = inf {c T x | F(x) > } . (2.22) 

So in this case, we can solve the strict EVP to obtain a solution of the nonstrict EVP. 
This is true for the problems GEVP and CP as well. 

We will say that the LMI F(x) > is strictly feasible if its strict version is feasible, 
i.e., if there is some xq £ R m such that F(xo) > 0. We have just seen that when an 
LMI is strictly feasible, we can replace nonstrict inequality with strict inequality in the 
problems EVP, GEVP, and CP in order to solve them. In the language of optimization 
theory, the requirement of strict feasibility is a (very strong) constraint qualification. 

When an LMI is feasible but not strictly feasible, (2.21) need not hold, and the 
EVPs with the strict and nonstrict LMIs can be very different. As a simple example, 
consider F(x) = diag(x, —a;) with x € R. The right-hand side of (2.22) is +oo since 
the strict LMI F(x) > is infeasible. The left-hand side, however, is always 0, since 
the LMI F(x) > has the single feasible point x = 0. This example shows one of the 
two pathologies that can occur: The nonstrict inequality contains an implicit equality 
constraint (in contrast with an explicit equality constraint as in §2.1.2). 

The other pathology is demonstrated by the example F{x) = diag(x,0) with 
x € R and c = — 1. Once again, the strict LMI is infeasible so the right-hand side 
of (2.22) is +oo. The feasible set for the nonstrict LMI is the interval [0, oo) so the 
right-hand side is — oo. The problem here is that F(x) is always singular. Of course, 
the nonstrict LMI F(x) > is equivalent (in the sense of defining equal feasible sets) to 
the "reduced" LMI F(x) = x > 0. Note that this reduced LMI satisfies the constraint 
qualification, i.e., is strictly feasible. 

2.5.1 Reduction to a strictly feasible LMI 

It turns out that any feasible nonstrict LMI can be reduced to an equivalent LMI that 
is strictly feasible, by eliminating implicit equality constraints and then reducing the 
resulting LMI by removing any constant nullspace. 

The precise statement is: Let Fq, . . . ,F m G R™ n be symmetric. Then there 
is a matrix A G R m p with p < m, a vector b € R m , and symmetric matrices 
F ,...,F p € R qxq with q < n such that: 

x = Az + b for some z G R p with 



F(x) > ^^ - , A - tA - 

1 ; " F(z)=F + ^2ziFi>0 

i=l 
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where the LMI F(z) > is either infeasible or strictly feasible. See the Notes and 
References for a proof. 

The matrix A and vector b describe the implicit equality constraints for the LMI 
F(x) > 0. Similarly, the LMI F{z) > can be interpreted as the original LMI with 
its constant nullspace removed (see the Notes and References). In most of the prob- 
lems encountered in this book, there are no implicit equality constraints or nontrivial 
common nullspace for F, so we can just take A = I, b = 0, and F = F. 

Using this reduction we can, at least in principle, always deal with strictly feasible 
LMIs. For example we have 



inf {c T x | F(x) > 0} = inf ic T {Az + b) F{z) > o} 
= inf L T (Az + b) F(z)>o\ 



since the LMI F(z) > is either infeasible or strictly feasible. 



2.5.2 Example: Lyapunov inequality 

To illustrate the previous ideas we consider the simplest LMI arising in control theory, 
the Lyapunov inequality: 



A T P- 



PA<Q, P > 0, 



(2.23) 



where A G R x is given, and the symmetric matrix P is the variable. Note that this 
LMI contains a strict inequality as well as a nonstrict inequality. 

We know from system theory that the LMI (2.23) is feasible if and only if all 
trajectories of x = Ax are bounded, or equivalently, if the eigenvalues of A have 
nonpositive real part, and those with zero real part are nondefective, i.e., correspond 
to Jordan blocks of size one. We also know from system theory that the LMI (2.23) is 
strictly feasible if and only if all trajectories of x = Ax converge to zero, or equivalently, 
all the eigenvalues of A have negative real part. 

Consider the interesting case where the LMI (2.23) is feasible but not strictly 
feasible. From the remarks above, we see that by a change of coordinates we can put 
A in the form 



A = T- X AT 



diag 



uj x I kl 











^stab 



L Xfe r 



where < u>\ < ■ ■ ■ < to r , ®k r+1 denotes the zero matrix in R r+1 r+1 , and all the 
eigenvalues of the matrix A sta b € R sxs have negative real part. Roughly speaking, 
we have separated out the stable part of A, the part corresponding to each imaginary 
axis eigenvalue, and the part associated with eigenvalue zero. 
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Using standard methods of Lyapunov theory it can be shown that 
{P | A T P + PA<0, P > 0} = 



•Pr+1, P 



ji-Tprp-1 




stab 



,r 



> . 



>0, i = 1, 



QT ^ 

^-stab-^stab + Pstab^stab < 



P r+ 1 > 0, P st ab > 



From this characterization we can find a reduced LMI that is strictly feasible. We 
can take the symmetric matrices Pi, ... , P r +i, Pstab and the skew-symmetric matrices 
Qi, ■ ■ ■ ,Q r as the "free" variable z; the affine mapping from z into x simply maps 
these matrices into 



P 



diag 



Pi 


Qi 




p r 


Q 


Ql 


Pi 


, . . . , 


QT 


p, 



> "r+l; Ps 



slab 



T-\ 



(2.24) 



Put another way, the equality constraints implicit in the LMI (2.23) are that P must 
have this special structure. 

Now we substitute P in the form (2.24) back into the original LMI (2.23). We 
find that 

A T P +PA = T T diag (0, Af tab P stab + Pstab^stab) T 

where the zero matrix has size 2&i + - • -+2k r + k r+ i. We remove the constant nullspace 
to obtain the reduced version, i.e., ^4^ ab P sta b + Pstab^stab < 0. Thus, the reduced 
LMI corresponding to (2.23) is 



P ^ci 

QI Pi 



>0, i = l,...,r, P r+1 >0, 



(2.25) 



Pstab > 0, j4 stab P sta b + Pstab^l; 



slab 



< 0. 



This reduced LMI is strictly feasible, since we can take Pi, ... , P r +i as identity matri- 
ces, Qi, . . . , Q r as zero matrices, and P s t a b as the solution of the Lyapunov equation 

^stab^stab + Pstab^stab + / = 0. 

In summary, the original LMI (2.23) has one symmetric matrix of size k as variable 
(i.e., the dimension of the original variable x is m = k(k + l)/2). The reduced 
LMI (2.25) has as variable the symmetric matrices Pi, . . . , P r +i, Pstab and the skew- 
symmetric matrices Qi, . . . , Q r , so the dimension of the variable z in the reduced LMI 

is 

The original LMI (2.23) involves a matrix inequality of size n = 2k (i.e., two inequali- 
ties of size k). The reduced LMI (2.25) involves a matrix inequality of size k + s < n. 
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2.6 Miscellaneous Results on Matrix Inequalities 

2.6.1 Elimination of semidefinite terms 

In the LMIP 

find x such that F(x) > 0, (2.26) 

we can eliminate any variables for which the corresponding coefficient in F is semidef- 
inite. Suppose for example that F m > and has rank r < n. Roughly speaking, by 
taking x m extremely large, we can ensure that F is positive-definite on the range of 
F m , so the LMIP reduces to making F positive-definite on the nullspace of F m . 

More precisely, let F m = UU T with U full-rank, and let U be an orthogonal 
complement of U, e.g., U T U = and [U U] is of maximum rank (which in this case 
just means that [U U] is nonsingular) . Of course, we have 



F{x) > «=> \u U\ F{x) \fj U\ > 0. 
Since 



U u\ F(x) \u u\ 



U T F(x)U U T F(x)U 

U T F{x)U U T F{x)U + x m {U T U) 2 



where x = [x\ ■ ■ -x m _i] T and F(x) = Fq + X\F\ + • • • + x TO _i-F m _i, we see that 
F(x) > if and only if U T FU(x) > 0, and x m is large enough that 

U T F{x)U + x m {U T Uf > U T F{£)U (u T F{x)lj) U T F{x)U. 

Therefore, the LMIP (2.26) is equivalent to 

find Xi, . . . , x m _i such that U F(xi, . . . , x m ^\)U > 0. 

2.6.2 Elimination of matrix variables 

When a matrix inequality has some variables that appear in a certain form, we can 
derive an equivalent inequality without those variables. Consider 

G{z) + U{z)XV{z) T + V{z)X T U(z) T > 0, (2.27) 

where the vector z and the matrix X are (independent) variables, and G(z) G R" x ™, 
U(z) and V(z) do not depend on X. Matrix inequalities of the form (2.27) arise in 
the controller synthesis problems described in Chapter 7. 

Suppose that for every z, U(z) and V(z) are orthogonal complements of U(z) 
and V(z) respectively. Then (2.27) holds for some X and z = zq if and only if the 
inequalities 

U{z) T G{z)U{z) > 0, V{z) T G{z)V{z) > (2.28) 

hold with z = Zq. In other words, feasibility of the matrix inequality (2.27) with 
variables X and z is equivalent to the feasibility of (2.28) with variable z; we have 
eliminated the matrix variable X from (2.27) to form (2.28). Note that if U(z) or 
V(z) has rank n for all z, then the first or second inequality in (2.28) disappears. We 
prove this lemma in the Notes and References. 
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2:s 



We can express (2.28) in another form using Finsler's lemma (see the Notes and 
References) : 



G(z) - <rU{z)U{z) T >0, 



G{z)-<tV{z)V{z) t >0 



for some a G R. 

As an example, we will encounter in §7.2.1 the LMIP with LMI 

Q>0, AQ + QA T + BY + Y T B T < 0, (2.29) 

where Q and Y are the variables. This LMIP is equivalent to the LMIP with LMI 

Q > 0, AQ + QA T < <jBB t , 
where the variables are Q and a G R. It is also equivalent to the LMIP 

Q>0, B T (AQ + QA T ) B < 0, 

with variable Q, where B is any matrix of maximum rank such that B T B = 0. Thus 
we have eliminated the variable Y from (2.29) and reduced the size of the matrices in 
the LMI. 

2.6.3 The <S-procedure 

We will often encounter the constraint that some quadratic function (or quadratic 
form) be negative whenever some other quadratic functions (or quadratic forms) are 
all negative. In some cases, this constraint can be expressed as an LMI in the data 
defining the quadratic functions or forms; in other cases, we can form an LMI that is 
a conservative but often useful approximation of the constraint. 



F ■ 



The iS-procedure for quadratic functions and nonstrict inequalities 

Let Fq, . . . , F p be quadratic functions of the variable (" G R": 

F t (C) = C T T t C+2uJC + v l , i = 0,...,p, 
where Tj = Tf . We consider the following condition on F , 

Fq(Q > for all C such that F t (Q > 0, i = l,...,p. 
Obviously if 

there exist n > 0, . . . , t p > such that 
p 

for all C, *b(C) " X>i*H0 > 0, 



(2.30) 



(2.31) 



then (2.30) holds. It is a nontrivial fact that when p = 1, the converse holds, provided 
that there is some £o such that -Fi(Co) > 0. 

Remark: If the functions F t are affine, then (2.31) and (2.30) are equivalent; 
this is the Farkas lemma. 



Note that (2.31) can be written as 
T u 



Uq Vq 



P 

£- 

i=l 



T, 



Ui 



> 0. 
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The <S-procedure for quadratic forms and strict inequalities 



We will use another variation of the <S-procedure, which involves quadratic forms and 
strict inequalities. Let T ,...,T p € R™ x ™ be symmetric matrices. We consider the 
following condition on Tq , . . . , T p : 



( T T ( > for all C ^ such that C T ^C > 0, 



1. 



,p. 



It is obvious that if 

p 
there exists n > 0, . . . , t p > such that Tq — 2, T iTi > 0, 



(2.32) 



(2.33) 



then (2.32) holds. It is a nontrivial fact that when p = 1, the converse holds, provided 
that there is some £o such that Ctf^iCo > 0. Note that (2.33) is an LMI in the variables 
T and n,...,r p . 

Remark: The first version of the S-procedure deals with nonstrict inequalities 
and quadratic functions that may include constant and linear terms. The second 
version deals with strict inequalities and quadratic forms only, i.e., quadratic 
functions without constant or linear terms. 



Remark: Suppose that To, uq and vo depend amnely on some parameter v. Then 
the condition (2.30) is convex in v. This does not, however, mean that the problem 
of checking whether there exists v such that (2.30) holds has low complexity. On 
the other hand, checking whether (2.31) holds for some v is an LMIP in the 
variables v and n, . . . , r p . Therefore, this problem has low complexity. See the 
Notes and References for further discussion. 



5-procedure example 

In Chapter 5 we will encounter the following constraint on the variable P: 



for all £ / and 7r satisfying ir T ir < £, T C T C£,, 



4 

7T 



A 1 P + PA PB 



B T P 







-K 



<0. 



(2.34) 



Applying the second version of the 5-procedure, (2.34) is equivalent to the existence 
of r > such that 



A T P +PA + tC t C PB 
B T P -tI 



<0. 



Thus the problem of finding P > such that (2.34) holds can be expressed as an 
LMIP (in P and the scalar variable r). 

2.7 Some LMI Problems with Analytic Solutions 

There are analytic solutions to several LMI problems of special form, often with im- 
portant system and control theoretic interpretations. We briefly describe some of these 
results in this section. At the same time we introduce and define several important 
terms from system and control theory. 
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2.7.1 Lyapunov's inequality 

We have already mentioned the LMIP associated with Lyapunov's inequality i.e. 

P > 0, A T P + PA<0 

where P is variable and A € R" n is given. Lyapunov showed that this LMI is feasible 
if and only the matrix A is stable, i.e., all trajectories of x = Ax converge to zero as 
t — > oo, or equivalently, all eigenvalues of A must have negative real part. To solve this 
LMIP, we pick any Q > and solve the Lyapunov equation A T P + PA = — Q, which 
is nothing but a set of n(n + l)/2 linear equations for the n(n + l)/2 scalar variables 
in P. This set of linear equations will be solvable and result in P > if and only if 
the LMI is feasible. In fact this procedure not only finds a solution when the LMI is 
feasible; it parametrizes all solutions as Q varies over the positive-definite cone. 

2.7.2 The positive-real lemma 

Another important example is given by the positive-real (PR) lemma, which yields 
a "frequency-domain" interpretation for a certain LMIP, and under some additional 
assumptions, a numerical solution procedure via Riccati equations as well. We give a 
simplified discussion here, and refer the reader to the References for more complete 
statements. 

The LMI considered is: 



P>0, 



A T P +PA PB- C T 
B T P-C -D T -D 



< 0, (2.35) 



where A G R™ ™, B € R™ p , C € R p ™, and D £ R p p are given, and the matrix 
P = P T G R nxn i s t ne variable. (We will encounter a variation on this LMI in §6.3.3.) 
Note that if D + D T > 0, the LMI (2.35) is equivalent to the quadratic matrix 
inequality 

A T P +PA+ {PB - C T ){D + D T )- 1 {PB - C T ) T < 0. (2.36) 

We assume for simplicity that A is stable and the system (A, B, C) is minimal. 

The link with system and control theory is given by the following result. The 
LMI (2.35) is feasible if and only if the linear system 

x = Ax + Bu, y = Cx + Du (2.37) 

is passive, i.e., 

(■T 

u{t) T y{t) dt>0 
o 

for all solutions of (2.37) with x(0) = 0. 

Passivity can also be expressed in terms of the transfer matrix of the linear sys- 
tem (2.37), defined as 

H{s) = C{sl- A)~ l B + D 

for s G C. Passivity is equivalent to the transfer matrix H being positive-real, which 
means that 

H{s) + H{s)* > for all Re s > 0. 
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When p = 1 this condition can be checked by various graphical means, e.g., plotting 
the curve given by the real and imaginary parts of H(iu>) for lu G R (called the Nyquist 
plot of the linear system). Thus we have a graphical, frequency-domain condition for 
feasibility of the LMI (2.35) . This approach was used in much of the work in the 1960s 
and 1970s described in §1.2. 

With a few further technical assumptions, including D + D T > 0, the LMI (2.35) 
can be solved by a method based on Riccati equations and Hamiltonian matrices. 
With these assumptions the LMI (2.35) is feasible if and only if there exists a real 
matrix P = P T satisfying the ARE 



A T P- 



PA + {PB - C 1 ){D + D 1 )~\PB - C 



iT\T 



0, 



(2.38) 



which is just the quadratic matrix inequality (2.36) with equality substituted for in- 
equality. Note that P > 0. 

To solve the ARE (2.38) we first form the associated Hamiltonian matrix 



M 



A-B{D + D T )- l C 
-Cf r {D + D T )- 1 C 



B{D + D T )- l B T 
- A T + C T {D + D T )- l B T 



Then the system (2.37) is passive, or equivalently, the LMI (2.35) is feasible, if and 
only if M has no pure imaginary eigenvalues. This fact can be used to form a Routh- 
Hurwitz (Sturm) type test for passivity as a set of polynomial inequalities in the data 
A, B, C, and D. 

When M has no pure imaginary eigenvalues we can construct a solution P aro as 
follows. Pick V € R " xn so that its range is a basis for the stable eigenspace of 
M, e.g., V = [vi ■ ■ ■ v n ] where v±, . . . , v n are a set of independent eigenvectors of M 
associated with its n eigenvalues with negative real part. Partition V as 



V 



v 2 



where V\ and V2 are square; then set P arc = V2V1 . The solution P arc thus obtained 
is the minimal element among the set of solutions of (2.38): if P = P T satisfies (2.38), 
then P > -P are . Much more discussion of this method, including the precise technical 
conditions, can be found in the References. 



2.7.3 The bounded-real lemma 

The same results appear in another important form, the bounded-real lemma. Here 
we consider the LMI 



P>0, 



A T P +PA+ C T C PB + C T D 
B T P + D T C D T D-I 



< 0. 



(2.39) 



where A e R" x ", B G R nxp , C e R px ™, and D E R pxp are given, and the matrix 
P = P T € j^«x™ j g ^g variable. For simplicity we assume that A is stable and 
(A, B, C) is minimal. 

This LMI is feasible if and only the linear system (2.37) is nonexpansive, i.e., 

y(t) T y{t) dt< u(t) T u{t) dt 
Jo 
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for all solutions of (2.37) with a;(0) = 0, This condition can also be expressed in 
terms of the transfer matrix H. Nonexpansivity is equivalent to the transfer matrix 
H satisfying the bounded-real condition, i.e., 

H{s)*H{s)<I for all Res>0. 

This is sometimes expressed as ||-ff||oo < 1 where 

||H||oo = sup{ \\H(s)\\ | Re S >0} 

is called the Hqo norm of the transfer matrix H. 

This condition is easily checked graphically, e.g., by plotting \\H (iu>)\\ versus wgR 
(called a singular value plot for p > 1 and a Bode magnitude plot for p = 1). 

Once again we can relate the LMI (2.39) to an ARE. With some appropriate tech- 
nical conditions (see the Notes and References) including D T D < I, the LMI (2.39) 
is feasible if and only the ARE 

A T P + PA + C T C + {PB + C T D){I - D T D)-\PB + C T D) T = (2.40) 

has a real solution P = P T . Once again this is the quadratic matrix inequality 
associated with the LMI (2.39), with equality instead of the inequality. 
We can solve this equation by forming the Hamiltonian matrix 

A + B{I-D T D)- 1 D T C B{I-D T D)- 1 B T 

-C T (I-DD T )- 1 C -A T -C T D(I-D T D)- 1 B T 



M 



Then the system (2.37) is nonexpansive, or equivalently, the LMI (2.39) is feasible, 
if and only if M has no imaginary eigenvalues. In this case we can find the minimal 
solution to (2.40) as described in §2.7.2. 

2.7.4 Others 

Several other LMI problems have analytic solutions, e.g., the ones that arise in the 
synthesis of state-feedback for linear systems, or estimator gains for observers. As a 
simple example consider the LMI 

P > 0, AP + PA T < BB T 

where A e R" x ™ and B e R™ xp are given and P is the variable. This LMI is feasible 
if and only if the pair (A, B) is stabilizable, i.e., there exists a matrix K 6 R px ™ such 
that A + BK is stable. There are simple methods for determining whether this is the 
case, and when it is, of constructing an appropriate P. We will describe this and other 
results in Chapter 7. 

Several standard results from linear algebra can also be interpreted as analytic 
solutions to certain LMI problems of special form. For example, it is well-known 
that among all symmetric positive-definite matrices with given diagonal elements, 
the diagonal matrix has maximum determinant. This gives an analytic solution to a 
special (and fairly trivial) CP (see §3.5). 

Notes and References 

LMIs 

The term "Linear Matrix Inequality" was coined by J. C. Willems and is widely used now. 
As we mentioned in §1.2, the term was used in several papers from the 1970s to refer to the 
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specific LMI (1.3). The term is also consistent with the title of an early paper by Bellman 
and Ky Fan: On Systems of Linear Inequalities in Hermitian Matrix Variables [BF63] (see 
below). 

A more accurate term might be "Affine Matrix Inequality" , since the matrix is an affine 
and not linear function of the variable. We think of the term "Linear Matrix Inequality" as 
analogous to the term "Linear Inequalities" used to describe a i x<bi,i = l,...,p. 

When m — 1, an LMI is nothing but a matrix pencil. The theory of matrix pencils is very old 
and quite complete, and there is an extensive literature (see, e.g., Golub and Van Loan [GL89] 
and the references therein). For the case of m — 1 all of our standard problems are readily 
solved, e.g., by simultaneous diagonalization of Fo and Fi by an appropriate congruence. 
The case m = 1 does not arise on its own in any interesting problem from system or control 
theory, but it does arise in the "line-search (sub)problem", i.e., when we restrict one of our 
standard problems to a specific line. In this context the variable represents the "step length" 
to be taken in an iteration of an algorithm, and it useful to know that such problems can be 
solved extremely efficiently. 

Schur complements for nonstrict inequalities 

The Schur complement result of section §2.1 can be generalized to nonstrict inequalities as 
follows. Suppose Q and R are symmetric. The condition 



Q S 

S T R 



> 



(2.41) 



is equivalent to 
R> 0, 



SR t S T >0, S{I-RR 1 ) = 0, 



where R? denotes the Moore-Penrose inverse of R. 

To see this, let U be an orthogonal matrix that diagonalizes R, so that 



U T RU = 



E 




where E > and diagonal. Inequality (2.41) holds if and only if 



/ 

U T 



R 



I 

u 



Q Si S 2 
Sf E 



s 







>o, 



where [Si S2} ~ SU, with appropriate partitioning. We must then have S2 = 0, which holds 
if and only if S(I - RR ] ) = 0, and 



Si 



Si 



>o, 



which holds if and only if ( 



SR^S T > 0. 



Formulating convex problems in terms of LMIs 

The idea that LMIs can be used to represent a wide variety of convex constraints can be 
found in Nesterov and Nemirovskii's report [NN90b] (which is really a preliminary version 
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of their book [NN94]) and software manual [NN90a]. They formalize the idea of a "positive- 
definite representable" function; see §5.3, §5.4, and §6.4 of their book [NN94]. The idea 
is also discussed in the article [Ali92b] and thesis [Ali91] of Alizadeh. In [BE93], Boyd 
and El Ghaoui give a list of quasiconvex functions that can be represented as generalized 
eigenvalues of matrices that depend afiinely on the variable. 



Infeasibility criterion for LMIs 

The LMI F(x) > is infeasible means the affine set {F(x) | x £ R m } does not intersect the 
positive-definite cone. From convex analysis, this is equivalent to the existence of a linear 
functional ip that is positive on the positive-definite cone and nonpositive on the affine set 
of matrices. The linear functionals that are positive on the positive-definite cone are of the 
form ip(F) = Tr GF, where G > and G/0. From the fact that tf> is nonpositive on the 
affine set {F(x)\x £ R m }, we can conclude that Tr GF = 0, i = 1, . . . ,m and Tr GF < 0. 
These are precisely the conditions for infeasibility of an LMI that we mentioned in §2.2.1. 

For the special case of multiple Lyapunov inequalities, these conditions are given Bellman 
and Ky Fan [BF63] and Kamenetskii and Pyatnitskii [KP87A, KP87b]. 

It is straightforward to derive optimality criteria for the other problems, using convex anal- 
ysis. Some general references for convex analysis are the books [ROC70, ROC82] and survey 
article [Roc93] by Rockafellar. The recent text [HUL93] gives a good overview of convex 
analysis; LMIs are used as examples in several places. 



Complexity of convex optimization 

The important role of convexity in optimization is fairly widely known, but perhaps not 
well enough appreciated, at least outside the former Soviet Union. In a standard (Western) 
treatment of optimization, our standard problems LMIP, EVP, GEVP, and CP would be 
considered very difficult since they are nondifferentiable and nonlinear. Their convexity 
properties, however, make them tractable, both in theory and in practice. 

In [ROC93, P194], Rockafellar makes the important point: 

One distinguishing idea which dominates many issues in optimization theory is 
convexity . . . An important reason is the fact that when a convex function is 
minimized over a convex set every locally optimal solution is global. Also, first- 
order necessary conditions turn out to be sufficient. A variety of other properties 
conducive to computation and interpretation of solutions ride on convexity as 
well. In fact the great watershed in optimization isn't between linearity and 
nonlinearity, but convexity and nonconvexity. 

Detailed discussion of the (low) complexity of convex optimization problems can be found 
in the books by Grotschel, Lovasz, Schrijver [GLS88], Nemirovskii and Yudin [NY83], and 
Vavasis [Vav91]. 

Complete and detailed worst-case complexity analyses of several algorithms for our standard 
problems can be found in Chapters 3, 4, and 6 of Nesterov and Nemirovskii [NN94]. 



Ellipsoid algorithm 

The ellipsoid algorithm was developed by Shor, Nemirovskii, and Yudin in the 1970s [Sho85, 
NY83]. It was used by Khachiyan in 1979 to prove that linear programs can be solved in 
polynomial-time [Kha79, GL81, GLS88]. Discussion of the ellipsoid method, as well as 
several extensions and variations, can be found in [BGT81] and [BB91, CH14]. A detailed 
history of its development, including English and Russian references, appears in chapter 3 of 
Akgiil [AKG84]. 
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Optimization problems involving LMIs 

One of the earliest papers on LMIs is also, in our opinion, one of the best: On systems of 
linear inequalities in Hermitian matrix variables, by Bellman and Ky Fan [BF63]. In this 
paper we find many results, e.g., a duality theory for the multiple Lyapunov inequality EVP 
and a theorem of the alternative for the multiple Lyapunov inequality LMIP. The paper 
concentrates on the associated mathematics: there is no discussion of how to solve such 
problems, or any potential applications, although they do comment that such inequality 
systems arise in Lyapunov theory. Given Bellman's legendary knowledge of system and 
control theory, and especially Lyapunov theory, it is tempting to conjecture that Bellman 
was aware of the potential applications. But so far we have found no evidence for this. 

Relevant work includes Cullum et al. [CDW75], Craven and Mond [CM81], Polak and 
Wardi [PW82], Fletcher [Fle85], Shapiro [Sha85], Friedland et al. [FN087], Goh and 
Teo [GT88], Panier [Pan89], Allwright [All89], Overton and Womersley [Ove88, Ove92, 
OW93, OW92], Ringertz [Rin91], Fan and Nekooie [FN92, Fan93] and Hiriart-Urruty and 
Ye [HUY92]. 

LMIPs are solved using various algorithms for convex optimization in Boyd and Yang [BY89], 
Pyatnitskii and Skorodinskii [PS83], Kamenetskii and Pyatnitskii [KP87A, KP87b]. 

A survey of methods for solving problems involving LMIs used by researchers in control theory 
can be found in the paper by Beck [Bec91]. The software packages [BDG + 91] and [CS92a] 
use convex programming to solve many robust control problems. Boyd [Boy94] outlines how 
interior-point convex optimization algorithms can be used to build robust control software 
tools. Convex optimization problems involving LMIs have been used in control theory since 
about 1985: Gilbert's method [Gil66] was used by Doyle [Doy82] to solve a diagonal scaling 
problem; another example is the MUSOL program of Fan and Tits [FT86]. 

Among other methods used in control to solve problems involving multiple LMIs is the 
"method of alternating convex projections" described in the article by Grigoriadis and Skel- 
ton [GS92]. This method is essentially a relaxation method, and requires an analytic ex- 
pression for the projection onto the feasible set of each LMI. It is not a polynomial-time 
algorithm; its complexity depends on the problem data, unlike the ellipsoid method or the 
interior-point methods described in [NN94], whose complexities depend on the problem size 
only. However, the alternating projections method is reported to work well in many cases. 

Interior-point methods for LMI problems 

Interior-point methods for various LMI problems have recently been developed by several re- 
searchers. The first were Nesterov and Nemirovskii [NN88, NN90B, NN90A, NN91A, NN94, 
NN93]; others include Alizadeh [Ali92b, Ali91, Ali92a], Jarre [Jar.93c], Vandenberghe and 
Boyd [VB93b], Rendl, Vanderbei, and Wolkowocz [RVW93], and Yoshise [YOS94]. 

Of course, general interior-point methods (and the method of centers in particular) have a 
long history. Early work includes the book by Fiacco and McCormick [FM68], the method 
of centers described by Huard et al. [LH66, Hua67], and Dikin's interior-point method 
for linear programming [Dik67]. Interest in interior-point methods surged in 1984 when 
Karmarkar [Kar84] gave his interior-point method for solving linear programs, which appears 
to have very good practical performance as well as a good worst-case complexity bound. Since 
the publication of Karmarkar's paper, many researchers have studied interior-point methods 
for linear and quadratic programming. These methods are often described in such a way 
that extensions to more general (convex) constraints and objectives are not clear. However, 
Nesterov and Nemirovskii have developed a theory of interior-point methods that applies to 
more general convex programming problems, and in particular, every problem that arises 
in this book; see [NN94]. In particular, they derive complexity bounds for many different 
interior-point algorithms, including the method of centers, Nemirovskii's projective algorithm 
and primal-dual methods. The most efficient algorithms seem to be Nemirovskii's projective 
algorithm and primal-dual methods (for the case of linear programs, see [Meh91]). 

Other recent articles that consider interior-point methods for more general convex program- 
ming include Sonnevend [Son88], Jarre [Jar91, Jar.93b], Kortanek et al. [KPY91], Den 
Hertog, Roos, and Terlaky [DRT92], and the survey by Wright [Wri92]. 
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Interior-point methods for GEVPs are described in Boyd and El Ghaoui [BE93] (variation 
on the method of centers), and Nesterov and Nemirovskii [NN91b] and [NN94, §4.4] (a 
variation on Nemirovskii's projective algorithm). Since GEVPs are not convex problems, 
devising a reliable stopping criterion is more challenging than for the convex problems LMIP, 
EVP, and CP. A detailed complexity analysis (in particular, a statement and proof of the 
polynomial-time complexity of GEVPs) is given in [NN91B, NN94]. See also [Jar93a]. 

Several researchers have recently studied the possibility of switching from an interior-point 
method to a quadratically convergent local method in order to improve on the final conver- 
gence; see [Ove92, OW93, OW92, FN92, NF92]. 

Interior-point methods for CPs and other related extremal ellipsoid volume problems can be 
found in [NN94, 56.51. 



Interior-point methods and problem structure 

Many researchers have developed algorithms that take advantage of the special structure 
of the least-squares problems arising in interior-point methods for linear programming. As 
far as we know, the first (and so far, only) interior-point algorithm that takes advantage 
of the special (Lyapunov) structure of an LMI problem arising in control is described in 
Vandenberghe and Boyd [VB93B, VB93a]. Nemirovskii's projective method can also take 
advantage of such structure; these two algorithms appear to be the most efficient algorithms 
developed so far for solving the LMIs that arise in control theory. 



Software for solving LMI problems 

Gahinet and Nemirovskii have recently developed a software package called LMI-Lab [GN93] 
based on an earlier FORTRAN code [NN90a], which allows the user to describe an LMI 
problem in a high-level symbolic form (not unlike the formulas that appear throughout this 
book!). LMI-Lab then solves the problem using Nemirovskii's projective algorithm, taking 
advantage of some of the problem structure (e.g., block structure, diagonal structure of some 
of the matrix variables). 

Recently, El Ghaoui has developed another software package for solving LMI problems. This 
noncommercial package, called LMI-tool, can be used with matlab. It is available via anony- 
mous ftp (for more information, send mail to elghaoui@ensta.fr). Another version of LMI- 
tool, developed by Nikoukhah and Delebecque, is available for use with the MATLAB-like 
freeware package SCILAB; in this version, LMI-tool has been interfaced with Nemirovskii's 
projective code. SCILAB can be obtained via anonymous ftp (for more information, send mail 
to Scilab@inria.fr). 

A commercial software package that solves a few specialized control system analysis and 
design problems via LMI formulation, called OPTIN, was recently developed by Olas and 
Associates; see [OS93, Ola94]. 



Reduction to a strictly feasible LMI 

In this section we prove the following statement. Let F Q ,...,F m £ r/ 1 *™ De symmetric 
matrices. Then there is a matrix A € R mxp with p < m, a vector b £ R m , and symmetric 
matrices Fq, . . . , F p g R <3X<3 with q < n such that: 



F(x) > if and only if x = Az + b for some z£R' and F(z) = F + V" z t F % > 0. 

1 = 1 

In addition, if the LMI F(x) > is feasible, then the LMI F(z) > is strictly feasible. 
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Consider the LMI 

m 

F(x) =F + ^2 x i p i > °- ( 2 - 42 ) 

where Fi G R" xn , i — 0,...,m. Let X denote the feasible set {x | F(x) > 0}. 

If X is empty, there is nothing to prove; we can take p — m, A — I, b — 0, Fo = Fq, . . . , F p — 

Henceforth we assume that X is nonempty. If X is a singleton, say X = {a^o}, then with 
p — 1, A — 0, b = Xo, Fo = 1, and F\ — 0, the statement follows. 

Now consider the case when X is neither empty nor a singleton. Then, there is an affine 
subspace A of minimal dimension p > 1 that contains X. Let Oi, . . . a p a basis for the linear 
part of A. Then every x £ A can be written as x — Az + b where A — [ai ■ ■ ■ a p ] is full-rank 
and fee R m . Defining G =F(b),Gi = F(a x )-F ,i = 1, . . . ,p, and G{z) = Go + ELi ZiGi ' 
we see that ^(a;) > if and only if there exists z € R p satisfying x — Az + b and G(z) > 0. 

Let Z = {z G R p | G(z) > 0}. By construction, Z has nonempty interior. Let 20 a point in 
the interior of Z and let v lie in the nullspace of G(zo). Now, v T G(z)v is a nonnegative affine 
function of z £ Z and is zero at an interior point of Z. Therefore, it is identically zero over 
Z, and hence over R p . Therefore, v belongs to the intersection B of the nullspaces of the 
Gi, i — 0, . . . ,p. Conversely, any 11 belonging to B will satisfy v T G(z)v — for any z £ R p . 
B may therefore be interpreted as the "constant" nullspace of the LMI (2.42). Let q be the 
dimension of B. 

If q — n (i.e., Go = • • • = G p = 0), then obviously F(x) > if and only if x = Az + b. In this 
case, the statement is satisfied with Fo — I £ R pxp , F, = 0, i = 1, . . . ,p. 

If q < n, let Vi,...,v q a basis for B. Complete the basis vi,...,v q by xiq+i, . . . ,V n to 
obtain a basis of R". Define U = [v q +i ■ ■ ■ v„], Fi — U T GiU, i = 1, . . . ,p and F(z) — 
F + ]T P =1 z;F. Then, the LMI G(z) > is equivalent to the LMI F(z) > 0, and by 
construction, there exists zo such that F(zo) > 0. This concludes the proof. 

Elimination procedure for matrix variables 

We now prove the matrix elimination result stated in §2.6.2: Given G, U, V, there exists X 
such that 

G + UXV T + VX T U T >0 (2.43) 

if and only if 

U T GU > 0, V T GV > (2.44) 

holds, where U and V are orthogonal complements of U and V respectively. 

It is obvious that if (2.43) holds for some X, so do inequalities (2.44). Let us now prove the 
converse. 

Suppose that inequalities (2.44) are feasible. Suppose now that inequality (2.43) is not feasible 
for any X. In other words, suppose that 

G + UXV T + VX T U T ? 0, 

for every X . By duality, this is equivalent to the condition 

there exists Z ^0 with Z > 0, V T ZU = 0, and Tr GZ < 0. (2.45) 
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Now let us show that Z > and V T ZU — imply that 

Z = VHH T V T + UKK T U T (2.46) 

for some matrices H, K (at least one of which is nonzero, since otherwise Z — 0). This 
will finish our proof, since (2.44) implies that TvGZ > for Z of the form (2.46), which 
contradicts Tr GZ < in (2.45). 

Let R be a Cholesky factor of Z, i.e. Z = RR T . The condition V T ZU = is equivalent to 

V T RR T U = (V T R)(U T R) T = 0. 
This means that there exists a unitary matrix T, and matrices M and N such that 

V T RT = [0 M] and U T RT = [N 0] , 

where the number of columns of M and N add up to that of R. In other words, RT can be 
written as 

RT = [A S] 

for some matrices A, B such that V A — 0, [/ T B = 0. From the definition of V and U. 
matrices A, B can be written A — VH, B — UK for some matrices H, K, and we have 

Z = RR T = (RT)(RT) T = AA T + BB T = VHH T V T + UKK T U T , 

which is the desired result. 

The equivalence between the conditions 



and 



U{z) T G(z)U{z) > 



G(z) — aU(z)U(z) > for some real scalar a 



is through Finsler's lemma [Fin37] (see also §3.2.6 of [Sch73]), which states that if x T Qx > 
for all nonzero x such that x Ax — 0, where Q and A are symmetric, real matrices, then 
there exists a real scalar such that Q — a A > 0. 

Finsler's lemma has also been directly used to eliminate variables in certain matrix inequali- 
ties (see for example [PH86, KR88, BPG89b]); it is closely related to the 5-procedure. We 
also note another result on elimination of matrix variables, due to Parrott [Par78] . 

The elimination lemma is related to a matrix dilation problem considered in [DKW82], which 
was used in control problems in [PZPB91, PZP + 92, Gah92, Pac94, IS93A, Iwa93] (see 
also the Notes and References of Chapter 7). 

The 5-procedure 

The problem of determining if a quadratic form is nonnegative when other quadratic forms are 
nonnegative has been studied by mathematicians for at least seventy years. For a complete 
discussion and references, we refer the reader to the survey article by Uhlig [Uhl79]. See also 
the book by Hestenes [Hes81, P354-360] and Horn and Johnson [HJ91, P78-86] for proofs 
of various 5-procedure results. 

The application of the 5-procedure to control problems dates back to 1944, when Lur'e 
and Postnikov used it to prove the stability of some particular nonlinear systems. The 
name "5-procedure" was introduced much later by Aizerman and Gantmacher in their 1964 
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book [AG64]; since then Yakubovich has given more general formulations and corrected 
errors in some earlier proofs [Yak77, FY79]. 

The proof of the two S-procedure results described in this chapter can be found in the 
articles by Yakubovich [FY79, Yak77, Yak73]. Recent and important developments about 
the 5-procedure can be found in [Yak92] and references therein. 

Complexity of 5-procedure condition 

Suppose that To, uq, and Vo depend affinely on some variable v\ the condition (2.30) is 
convex in v. Indeed, for fixed £, the constraint -Fo(C) > is a linear constraint on v and the 
constraint (2.30) is simply an infinite number of these linear constraints. We might therefore 
imagine that the problem of checking whether there exists v such that (2.30) holds has low 
complexity since it can be cast as a convex feasibility problem. This is wrong. In fact, merely 
verifying that the condition (2.30) holds for fixed data Ti, Ui, and Vi, is as hard as solving 
a general indefinite quadratic program, which is NP-complete. We could determine whether 
there exists v such that (2.30) holds with polynomially many steps of the ellipsoid algorithm; 
the problem is that finding a cutting plane (which is required for each step of the ellipsoid 
algorithm) is itself an NP-complete problem. 

Positive-real and bounded-real lemma 

In 1961, Popov gave the famous Popov frequency-domain stability criterion for the absolute 
stability problem [Pop62]. Popov's criterion could be checked via graphical means, by verify- 
ing that the Nyquist plot of the "linear part" of the nonlinear system was confined to a specific 
region in the complex plane. Yakubovich [Yak62, Yak64] and Kalman [Kal63a, Kal63b] 
established the connection between the Popov criterion and the existence of a positive- 
definite matrix satisfying certain matrix inequalities. The PR lemma is also known by various 
names such as the Yakubovich-Kalman-Popov- Anderson Lemma or the Kalman- Yakubovich 
Lemma. The PR lemma is now standard material, described in several books on control and 
systems theory, e.g., Narendra and Taylor [NT73], Vidyasagar [Vid92, PP474-478], Faurre 
and Depeyrot [FD77], Anderson and Vongpanitlerd [AV73, CH5-7], Brockett [Bro70], and 
Willems [Wil70]. 

In its original form, the PR Lemma states that the transfer function c(sl — A)~ l b of the 
single-input single-output minimal system (^4, b, c) is positive-real, i.e. 

Rec(s/- A) _1 fe> for all Res >0 (2.47) 

if and only if there exists P > such that A T P + PA < and Pb = c T . The condition (2.47) 
can be checked graphically. 

Anderson [And67, And66b, And73] extended the PR lemma to multi-input multi-output 
systems, and derived similar results for nonexpansive systems [And66a]. Willems [Wil71b, 
Wil74a] described connections between the PR lemma, certain quadratic optimal control 
problems and the existence of symmetric solutions to the ARE; it is in [Wil71b] that we find 
Willems' quote on the role of LMIs in quadratic optimal control (see page 3). 

The connections between passivity, LMIs and AREs can be summarized as follows. We 
consider A, B, C, and D such that all eigenvalues of A have negative real part, (A, B) is 
controllable and D + D T > 0. The following statements are equivalent: 

1. The system 

x — Ax + Bu, y = Cx + Du, x(0) - 

is passive, i.e., satisfies 

u(t) T y(t)dt > 
'o 

for all u and T > 0. 
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2. The transfer matrix H(s) — C(sl — A) 1 B + D is positive-real, i.e., 

H(s) + H(s)* > 
for all s with Re s > 0. 

3. The LMI 



< 



A T P + PA PB- C T 
B T P-C -(D + D T ) 

in the variable P — P T is feasible. 
4. There exists P = P T satisfying the ARE 

A T P + PA + (PB - C T )(D + D T )~ 1 (PB - 



C 1 



(2.48) 



(2.49) 



5. The sizes of the Jordan blocks corresponding to the pure imaginary eigenvalues of the 
Hamiltonian matrix 



M = 

are all even. 



A-B{D + D T )- 1 C 
-C T {D + D T )- l C 



B{D + D T )- 1 B T 
-A T +C T {D + D T )- 1 B 1 



The equivalence of 1, 2, 3, and 4 can be found in Theorem 4 of [Wil71b] and also [Wil74a]. 
The equivalence of 5 is found in, for example, Theorem 1 of [LR80]. Origins of this result can 
be traced back to Reid [Rei46] and Levin [Lev59]; it is explicitly stated in Potter [Pot66]. 
An excellent discussion of this result and its connections with spectral factorization theory 
can be found in [AV73]; see also [Fra87, BBK89, Rob89]. Connections between the ex- 
tremal points of the set of solutions of the LMI (2.48) and the solutions of the ARE (2.49) 
are explored in [Bad82]. The method for constructing P are from the stable eigenspace of 
M, described in §2.7.2, is in [BBK89]; this method is an obvious variation of Laub's algo- 
rithm [Lau79, AL84], or Van Dooren's [D008I]. A less stable numerical method can be found 
in [Rei46, Lev59, Pot66, AV73]. For other discussions of strict passivity, see the articles 
by Wen [Wen88], Lozano-Leal and Joshi [LLJ90]. It is also possible to check passivity (or 
nonexpansivity) of a system symbolically using Sturm methods, which yields Routh-Hurwitz 
like algorithms; see Siljak [Sil71, Sil73] and Boyd, Balakrishnan, and Kabamba [BBK89]. 

The PR lemma is used in many areas, e.g., interconnected systems [MH78] and stochastic 
processes and stochastic realization theory [Fau67, Fau73, AM74]. 
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Chapter 3 

Some Matrix Problems 

3.1 Minimizing Condition Number by Scaling 

The condition number of a matrix M £ R p q , with p > q, is the ratio of its largest 
and smallest singular values, i.e., 

A /A max (Af T M)\ 1/2 



K(M) = U in (M-M)J 



for M full-rank, and k(M) = oo otherwise. For square invertible matrices this reduces 
to k{M) = \\M\\\\M- 1 \\. 

We consider the problem: 

minimize n (LMR) 

subject to L £ R p p , diagonal and nonsingular (3-1) 

R £ ~R qxq , diagonal and nonsingular 

where L and R are the optimization variables, and the matrix M £ R pX9 is given. 
We will show that this problem can be transformed into a GEVP. We assume without 
loss of generality that p > q and M is full-rank. 

Let us fix 7 > 1. There exist nonsingular, diagonal L £ R pxp and R £ R qxq 
such that k(LMR) < 7 if and only if there are nonsingular, diagonal L £ R pxp and 
R £ R qxq and fi > such that 

fil < {LMR) T {LMR) < fi-/ 2 I. 

Since we can absorb the factor l/y^ into L, this is equivalent to the existence of 
nonsingular, diagonal L £ R pxp and R £ R qxq such that 

/ < {LMR) T {LMR) < 7 2 /, 

which is the same as 

{RR T y l < M T {L T L)M < 1 2 {RR T y 1 . (3.2) 

This is equivalent to the existence of diagonal P £ K pxp , Q £ K qxq with P > 0, 
Q > and 

Q < M T PM < 7 2 Q. (3.3) 
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To see this, first suppose that L G R pxp and R G ~R qxq are nonsingular and diagonal, 
and (3.2) holds. Then (3.3) holds with P = L T L and Q = (RR T )- 1 . Conversely, 
suppose that (3.3) holds for diagonal P € R pxp and Q e R ,x ' with P > 0, Q > 0. 
Then, (3.2) holds for L = P 1 ! 2 and R = Q~ x l 2 . 

Hence we can solve (3.1) by solving the GEVP: 

minimize j 2 

subject to P e K pxp and diagonal, P > 0, 

Q G R qxq and diagonal, Q > 

Q < M T PM < 7 2 Q 



Remark: This result is readily extended to handle scaling matrices that have 
a given block-diagonal structure, and more generally, the constraint that one or 
more blocks are equal. Complex matrices are readily handled by substituting M* 
(complex-conjugate transpose) for M T . 



3.2 Minimizing Condition Number of a 
Positive-Definite Matrix 

A related problem is minimizing the condition number of a positive-definite matrix 
M that depends affinely on the variable x, subject to the LMI constraint F(x) > 0. 
This problem can be reformulated as the GEVP 



(3.4) 



minimize 7 

subject to F(x) > 0, \i > 0, \xl < M(x) < j/j,I 

(the variables here are x, fi, and 7). 

We can reformulate this GEVP as an EVP as follows. Suppose 

m m 

M{x) = M + Y J x * M *i F(x)=F + ^2xiFi. 

i=i i=i 

Defining the new variables v = 1/ fi, x = x/ fi, we can express (3.4) as the EVP with 
variables v = l//z, x = x/fJ, and 7: 

minimize 7 

m m 

subject to vF + y^ %iFi > °> I < vM o + X^ %i M i < T^ 

8 = 1 t=l 

3.3 Minimizing Norm by Scaling 

The optimal diagonally scaled norm of a matrix M G C" ™ is defined as 
v{M) = inf { ||DMD _1 || J D G C" x ™ is diagonal and nonsingular} , 

where ||M|| is the norm (i.e., largest singular value) of the matrix M. We will show 
that v(M) can be computed by solving a GEVP. 
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Note that 

, s f (DMD- 1 )* (DMD- 1 ) <7 2 / ] 

v(M) = inf {7 V ' V ; ' ^ 

I for some diagonal, nonsingular D I 

= inf {7 M* D* DM < 7 D* D for some diagonal, nonsingular Dj 

= inf {7 J M*PM < -f 2 P for some diagonal P = P T > 0} . 

Therefore v{M) is the optimal value of the GEVP 

minimize 7 

subject to P > and diagonal, M*PM < ~/ 2 P 



Remark: This result can be extended in many ways. For example, the more 
general case of block-diagonal similarity-scaling, with equality constraints among 
the blocks, is readily handled. As another example, we can solve the problem of 
simultaneous optimal diagonal scaling of several matrices Mi, . . . , M p : To com- 
pute 




\dm 1 d~ 1 \ 



D £ C" x " is diagonal 
and nonsingular 



we simply compute the optimal value of the GEVP 

minimize 7 

subject to P > and diagonal, M'PMi < 7 2 P, i — l,...,p 

Another closely related quantity, which plays an important role in the stability 
analysis of linear systems with uncertain parameters, is 



P>0, M'PM + i(M*G-GM) <7 2 P, , N 

> , (3.5) 

P and G diagonal and real | 



inf < 7 
which can also be computed by solving a GEVP. 



3.4 Rescaling a Matrix Positive-Definite 

We are given a matrix M G Q nxn ^ an( j g^ whether there is a diagonal matrix D > 
such that the Hermitian part of DM is positive-definite. This is true if and only if 
the LMI 

M*D + DM > 0, D > (3.6) 

is feasible. So determining whether a matrix can be rescaled positive-definite is an 
LMIP. 

When this LMI is feasible, we can find a feasible scaling D with the smallest 
condition number by solving the EVP 

minimize /! 

subject to D diagonal, M*D + DM > 0, I < D < fi 2 I 
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Remark: The LMI (3.6) can also be interpreted as the condition for the existence 
of a diagonal quadratic Lyapunov function that proves stability of — M; see the 
Notes and References and §10.3. 



3.5 Matrix Completion Problems 



In a matrix completion problem some entries of a matrix are fixed and the others are 
to be chosen so the resulting ("completed") matrix has some property, e.g., is Toeplitz 
and positive-definite. Many of these problems can be cast as LMIPs. The simplest 
example is positive-definite completion: Some entries of a symmetric matrix are given, 
and we are to complete the matrix to make it positive-definite. Evidently this is an 
LMIP. 

We can easily recover some known results about this problem. We can assume that 
the diagonal entries are specified (otherwise we simply make them very large, i.e., elim- 
inate them; see §2.6.1). Then the set of all positive-definite completions, if nonempty, 
is bounded. Hence the associated LMI has an analytic center A* (see §2.4.1). The 
matrix A* is characterized by Tr A*~ B = for every B = B T that has zero entries 
where A has specified entries (see §2.4.2). Thus the matrix A*^ 1 has a zero entry 
in every location corresponding to an unspecified entry in the original matrix, i.e., 
it has the same sparsity structure as the original matrix. Depending on the pattern 
of specified entries, this condition can lead to an "analytic solution" of the problem. 
But of course an arbitrary positive-definite completion problem is readily solved as an 
LMIP 

Let us mention a few simple generalizations of this matrix completion problem 
that have not been considered in the literature but are readily solved as LMIPs, and 
might be useful in applications. 

Suppose that for each entry of the matrix we specify an upper and lower bound, 
in other words, we consider an "interval matrix" . For entries that are completely 
specified, the lower and upper bound coincide; for entries that are completely free 
the lower bound is — oo and the upper bound is +oo. The problem of completing 
the matrix, subject to the bounds on each entry, to make it positive-definite (or a 
contraction, i.e., have norm less than one) is readily formulated as an LMIP. 

As another example, suppose we are given a set of matrices that have some given 
sparsity pattern. We want to determine a matrix with the complementary sparsity 
pattern that, when added to each of the given matrices, results in a positive-definite 
(or contractive, etc.) matrix. We might call this the "simultaneous matrix completion 
problem" since we seek a completion that serves for multiple original matrices. This 
problem is also readily cast as an LMIP. 

As a more specific example, suppose we are given upper and lower bounds for 
some of the correlation coefficients of a stationary time-series, and must determine 
some complementary correlation coefficients that are consistent with them. In other 
words, we have a Toeplitz matrix with some entries within given intervals, while the 
others are completely free (unspecified). We must determine these free entries so that 
the Toeplitz matrix is positive-definite, for any choice of the specified elements in the 
intervals. This is an LMIP. 
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3.6 Quadratic Approximation of a Polytopic Norm 

Consider a piecewise linear norm || • || p i on R™ denned by 

II II A I T I 

Z p i = max Oj z\ , 

where Oi £ R n for i = 1 , . . . , p. 

For any P > 0, the quadratic norm defined by \\z\\p = v z T Pz = HP 1 ' 2 ^ satisfies 

V\/o|MIp < IMIpi < Va\\z\\ P for all z, 

for some constant a > 1. Thus, the quadratic norm v z T Pz approximates ||z|| p i within 
a factor of a. We will show that the problem of finding P that minimizes a, i.e., the 
problem of determining the optimal quadratic norm approximation of || • || p i, is an 
EVP. 

Let Vi, . . . , vl be the vertices of the unit ball B p i of || • || p i, so that 

B pl = {z | ||z|| p i<l} = Co {vi,..., v L ] 
and let Bp denote the unit ball of || • \\p. Then 

l/M\z\\ P < IMIpi < M\z\\ P (3.7) 

is equivalent to 

1/V^Bp C B p i C y^Bp. 
The first inclusion, 1/^/aBp C B p i, is equivalent to 

Oj P~ 1 a,i < a, i = 1, . . . ,p, 
and the second, i3 p i C y^Sp, is equivalent to 

Uj Pvi < a, i = 1, . . . , L. 
Therefore we can minimize a such that (3.7) holds for some P > by solving the EVP 



minimize a 

P a, 



subject to vf Pvi < a, i = 1, . . . , L, 



T 

af a 



> 0, i = l,. 



Remark: The number of vertices L of the unit ball of || • || p i can grow exponen- 
tially in p and n. Therefore the result is uninteresting from the point of view of 
computational complexity and of limited practical use except for problems with 
low dimensions. 

The optimal factor a in this problem never exceeds \fn. Indeed, by computing 
the maximum volume ellipsoid that lies inside the unit ball of || • || p i, (which does 
not require us to find the vertices Vi, i — 1, . . . , L), we can find (in polynomial- 
time) a norm for which a is less than ^/n. See the Notes and References for more 
details. 
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3.7 Ellipsoidal Approximation 

The problem of approximating some subset of R* with an ellipsoid arises in many 
fields and has a long history; see the Notes and References. To pose such a problem 
precisely we need to know how the subset is described, whether we seek an inner or 
outer approximation, and how the approximation will be judged (volume, major or 
minor semi-axis, etc.). 

In some cases the problem can be cast as a CP or an EVP, and hence solved 
exactly. As an example consider the problem of finding the ellipsoid centered around 
the origin of smallest volume that contains a polytope described by its vertices. We 
saw in §2.2.4 that this problem can be cast as a CP. 

In other cases, we can compute an approximation of the optimal ellipsoid by 
solving a CP or an EVP. In some of these cases, the problem is known to be NP-hard, 
so it is unlikely that the problem can be reduced (polynomially) to an LMI problem. 
It also suggests that the approximations obtained by solving a CP or EVP will not be 
good for all instances of problem data, although the approximations may be good on 
"typical" problems, and hence of some use in practice. 

As an example consider the problem of finding the ellipsoid of smallest volume 
that contains a polytope described by a set of linear inequalities, i.e., { x \ of x < 
bi, i = 1, . . . ,p }. (This is the same problem as described above, but with a different 
description of the polytope.) This problem is NP-hard. Indeed, consider the problem 
of simply verifying that a given, fixed ellipsoid contains a polytope described by a 
set of linear inequalities. This problem is equivalent to the general concave quadratic 
programming problem, which is NP-complete. 

In this section we consider subsets formed from ellipsoids £\ , . . . , £ p in various 
ways: union, intersection, and addition. We approximate these sets by an ellipsoid £q. 

We describe an ellipsoid £ in two different ways. The first description uses convex 
quadratic functions: 

£= {x | T{x) < 0,} , T(x) = x T Ax + 2x T b + c, (3.8) 

where A = A T > and b T A~ 1 b — c > (which ensures that £ is nonempty and does 
not reduce to a single point). Note that this description is homogeneous, i.e., we can 
scale A, b and c by any positive factor without affecting £. 
The volume of £ is given by 

vol(£) 2 = /3 det ({^A^b - c)^" 1 ) 

where /3 is a constant that depends only on the dimension of x, i.e., n. The diameter 
of £ (i.e., twice the maximum semi-axis length) is 



l y /(bTA- l b-c)X max (A- 1 ). 



We will also describe an ellipsoid as the image of the unit ball under an affine 
mapping with symmetric positive-definite matrix: 

£ = { x | (x - x c ) T P~ 2 {x - x c ) < 1 } = { Pz + x c | ||z|| < 1 } , (3.9) 

where P = P T > 0. (This representation is unique, i.e., P and x c are uniquely 
determined by the ellipsoid £.) The volume of £ is then proportional to detP, and 
its diameter is 2A max (P). 

Each of these descriptions of £ is readily converted into the other. Given a repre- 
sentation (3.8) (i.e., A, b, and c) we form the representation (3.9) with 



p= ^b T A^b-cA- x l 2 , x c = -A- 1 b. 
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Given the representation (3.9), we can form a representation (3.8) of £ with 



P- 



-P~ 2 X r 



Ju r , JT JU p 



1. 



(This forms one representation of £; we can form every representation of £ by scaling 
these A, b, c by positive factors.) 

3.7.1 Outer approximation of union of ellipsoids 

We seek a small ellipsoid £q that covers the union of ellipsoids £\, . . . ,£ p (or equiva- 
lently, the convex hull of the union, i.e., Co |Jf=i £»)• We will describe these ellipsoids 
via the associated quadratic functions Ti(x) = x T AiX + 2x T 6 i + Cj. We have 



£ o 2 \J £i 



(3.10) 



if and only if £i C £ for i = 1, . . . ,p. This is true if and only if, for each i, every x 
such that Ti(x) < satisfies Tq(x) < 0. By the 5-procedure, this is true if and only if 
there exist nonnegative scalars T\ , . . . , r p such that 

for every £, T Q (x) - TjTj(cc) < 0, i = l,...,p, 

or, equivalently, such that 



A b 
bo c 



^ 4 6i 
bT a 



<0, 



1. 



. ,p. 



Since our representation of £q is homogeneous, we will now normalize Aq, bo and Cq 
in a convenient way: such that b^A^ 6q — c = 1. In other words we set 



c = b^A % - 1 
and parametrize £ by Aq and b alone. Thus our condition becomes: 

A 



(3.11) 



bo 
bl blA^b 



1 



A % b { 
bT a 



<o, 



1. 



,]7. 



Using Schur complements, we obtain the equivalent LMI 



bl 





bl 
-A 



A t h 
bj c, 




<o, 



l,...,p, 



(3.12) 



with variables Ao, &o, and n, . . . , r p . To summarize, the condition (3.10) is equivalent 
to the existence of nonnegative ti,...,t p such that (3.12) holds. (Since Ao > 0, it 
follows that the Tj must, in fact, be positive.) 



With the normalization (3.11), the volume of £o is proportional to \j det A Q . 
Thus we can find the smallest volume ellipsoid containing the union of ellipsoids £\, 
. . . , £ p by solving the CP (with variables Aq, bo, and n, . . . , t p ) 

minimize log det Aq 

subject to ^o > 0, Ti > 0,...,t p > 0, (3.12) 
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This ellipsoid is called the Lowner-John ellipsoid for Uf=i^*j an d satisfies several 
nice properties (e.g., affine invariance, certain bounds on how well it approximates 
Co[j£i); see the Notes and References. 

We can find the ellipsoid (or equivalently, sphere) of smallest diameter that con- 
tains [J£i by minimizing A max (j4^ ) subject to the constraints of the previous CP, 
which is equivalent to solving the EVP 

maximize A 

subject to A >XI, n > 0, ...,t p > 0, (3.12) 



The optimal diameter is 2/vA°Pt, where A opt is an optimal value of the EVP above. 



3.7.2 Outer approximation of the intersection of ellipsoids 

Simply verifying that 
p 
£o2f]£ t (3.13) 

»=i 

holds, given £q, £\, . . . , £ p , is NP-complete, so we do not expect to recast it as one 
of our LMI problems. A fortiori, we do not expect to cast the problem of finding the 
smallest volume ellipsoid covering the intersection of some ellipsoids as a CP. 

We will describe three ways that we can compute suboptimal ellipsoids for this 
problem by solving CPs. The first method uses the <S-procedure to derive an LMI 
that is a sufficient condition for (3.13) to hold. 

Once again we normalize our representation of £ so that (3.11) holds. From the 
5-procedure we obtain the condition: there exist positive scalars n, . . . ,r p such that 



<0, 



which can be written as the LMI (in variables Aq, bo, and n, . . . , t p ): 



A bo 

bl blA-^bo - 1 _ 


p 


A t b, 
bf Cl _ 



Ao bo 
bl -1 
bo 




bl 



E- 



Ai 

bf 




bi 
Ci 





< 0. 



(3.14) 



This LMI is sufficient but not necessary for (3.13) to hold, i.e., it characterizes some 
but not all of the ellipsoids that cover the intersection of £\, . . . , £ p . We can find the 
best such outer approximation (i.e., the one of smallest volume) by solving the CP 
(with variables A , b , and r 1; . . . , t p ) 



minimize log det A 

subject to Ao > 0, t\ > 0, . 



> Tp > 0, 



(3.14) 



(3.15) 



Remark: The intersection |"] P =1 £% is empty or a single point if and only if there 
exist nonnegative n , 



. ,r p , not all zero, such that 



£- 



Ai h 
bf a 



> 0. 



In this case (3.15) is unbounded below. 
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Another method that can be used to find a suboptimal solution for the problem of 
determining the minimum volume ellipsoid that contains the intersection of £\ , . . . , £ v 
is to first compute the maximum volume ellipsoid that is contained in the intersection, 
which can be cast as a CP (see the next section). If this ellipsoid is scaled by a factor 
of n about it center then it is guaranteed to contain the intersection. See the Notes 
and References for more discussion. 

Yet another method for producing a suboptimal solution is based on the idea of 
the analytic center (see §2.4.1). Suppose that £\, . . . , £ v are given in the form (3.9), 
with cci, . . . , x p and Pi 

F(x) = diag 



, P p . The LMI 



{x - XifPi 



Pi l {x- Xl ) 



1 

/ P~\x-x p ) 



>o 



in the variable x has as feasible set the interior of HiLi £*■ We assume now that the 
intersection nf=i £» nas nonempty interior (i.e., is nonempty and not a single point). 
Let x* denote its analytic center, i.e., 

x* = arg min logdet F(x)~ 
F(x) > 

and let H denote the Hessian of logdet-F(x) -1 at x*. Then it can be shown that 

v 
£ ± [ x | ( x - x *) T H{x - x*) < 1} C p| St. 

i=i 

In fact we can give a bound on how poorly £q approximates the intersection. It can 
be shown that 

p 
{x | (x - x*) T H{x - x*) < (3p+l) 2 } D f]£ t . 

i=l 

Thus, when £q is enlarged by a factor of 3p + 1, it covers nf=i ^i- ^ ee the Notes and 
References for more discussion of this. 



3.7.3 Inner approximation of the intersection of ellipsoids 

The ellipsoid £q is contained in the intersection of the ellipsoids £ i , . . . , £ p if and only 
if for every x satisfying 

(x - x ) T P ~ 2 (x - x ) < 1, 

we have 



(x - x l ) T P i 2 (x - Xi) < 1, i = 1, 



,p. 



Using the <S-procedure, this is equivalent to the existence of nonnegative Ai, . . . , X p 



satisfying 

for every x, (x — Xi) T P~ (x — xi) 



-X i (x-x ) I P (x-x )<l-Aj, i = l,...,p. 
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We prove in the Notes and References section that (3.16) is equivalent to 



-Pf x % - x P 

fa - x ) T Aj - 1 
Po -\J 



<0, 



(3.17) 



Therefore, we obtain the largest volume ellipsoid contained in the intersection of 
the ellipsoids £\, . . . , £ p by solving the CP (with variables P , x a , A 1; . . . , A p ) 

minimize log det Pq 

subject to P > 0, Ai > 0, . . . , A p > 0, (3.17) 



Among the ellipsoids contained in the intersection of £\ , 
that maximizes the smallest semi-axis by solving 



, £ p , we can find one 



minimize A 

subject to XI > P > 0, Ai > 0, . . . , A p > 0, (3.17) 

Equivalently, this yields the sphere of largest diameter contained in the intersection. 

3.7.4 Outer approximation of the sum of ellipsoids 

The sum of the ellipsoids £\ , . . . , £ p is defined as 



A 



A = {xi h + x p | xi e £\, 



, U,p 



e£ P .} 



We seek a small ellipsoid £q containing A. 

The ellipsoid £q contains A if and only if for every x\, 
i = 1, . . . ,p, we have 



, x p such that Ti(xi) < 0, 



T ]>> 4 <0. 



By application of the 5-procedure, this condition is true if 

there exist r, > 0, i = 1, . . . ,p such that 

/ p \ p 

for every x t , i = l,...,p, T Q I ^ x t J- ^ nT^Xi) < 0. 



(3.18) 



Once again we normalize our representation of £q so that (3.11) holds. Let x 
denote the vector made by stacking Xi, . . . , x p , i.e., 



Xl 



Then both T (X}?=i x i) an( ^ 5Z^f=i T iTi(xi) are quadratic functions of x. Indeed, let 
Ei denote the matrix such that Xi = EiX, and define 



E = 2_^ Eii ^ = E i AiEi 



Ei h, i = 0,...,p. 
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With this notation, we see that condition (3.18) is equivalent to 
there exist Tj > 0, i = 1, . . . ,p such that 



A 



hi blA^b, 



1 



i=\ 



A, 

IT 



< 0. 



This condition is readily written as an LMI in variables A$, bo, Ti, . . 
complements: 



A 6 



% 



-1 




bl 



b a -A a 



p 

?=i 



A, 6, 
bj c t 




< 0. 



(3.19) 



, t p using Schur 



(3.20) 



Therefore, we compute the minimum volume ellipsoid, proven to contain the 
sum of £\, . . . ,£ p via the <S-procedure, by minimizing logdet^o" subject to A > 
and (3.20). This is a CP. 

Notes and References 

Matrix scaling problems 

The problem of scaling a matrix to reduce its condition number is considered in Forsythe 
and Straus [FS55] (who describe conditions for the scaling T to be optimal in terms of a 
new variable R — j 1 -*^- 1 ^ j us t as we do). Other references on the problem of improving 
the condition number of a matrix via diagonal or block-diagonal scaling are [Bau63, GV74, 
Sha85, Wat91]. We should point out that the results of §3.1 are not practically useful in 
numerical analysis, since the cost of computing the optimal scalings (via the GEVP) exceeds 
the cost of solving the problem for which the scalings are required (e.g., solving a set of linear 
equations). 

A special case of the problem of minimizing the condition number of a positive-definite matrix 
is considered in [EHM92, EHM93]. Finally, Khachiyan and Kalantari ([KK92]) consider the 
problem of diagonally scaling a positive semidefinite matrix via interior-point methods and 
give a complete complexity analysis for the problem. 

The problem of similarity-scaling a matrix to minimize its norm appears often in control 
applications (see for example [Doy82, Saf82]). A recent survey article on this topic is by 
Packard and Doyle [PD93]. A related quantity is the one-sided multivariable stability margin, 
described in [TSC92]; the problem of computing this quantity can be easily transformed 
into a GEVP. The quantity (3.5), used in the analysis of linear systems with parameter 
uncertainties, was introduced by Fan, Tits and Doyle in [FTD91] (see also [BFBE92]). Some 
closely related quantities were considered in [SC93, SL93A, CS92b]; see also Chapter 8 in 
this book, where a different approach to the same problem and its extensions is considered. 
We also mention some other related quantities found in the articles by Kouvaritakis and 
Latchman [KL85A, KL85B, LN92], and Rotea and Prasanth [RP93, RP94], which can be 
computed via GEVPs. 

Diagonal quadratic Lyapunov functions 

The problem of rescaling a matrix M with a diagonal matrix D > such that its Her- 
mitian part is positive-definite is considered in the articles by Barker et al. [BBP78] and 
Khalil [Kha82]. The problem is the same as determining the existence of diagonal quadratic 
Lyapunov function for a given linear system; see [Ger85, Hu87, BY89, Kra91]. In [KB93], 
Kaszkurewicz and Bhaya give an excellent survey on the topic of diagonal Lyapunov func- 
tions, including many references and examples from mathematical ecology, power systems, 
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and digital filter stability. Diagonal Lyapunov functions arise in the analysis of positive 
orthant stability; see §10.3. 



Matrix completion problems 

Matrix completions problems are addressed in the article of Dym and Gohberg [DG81], in 
which the authors examine conditions for the existence of completions of band matrices such 
that the inverse is also a band matrix. In [GJSW84], their results are extended to a more 
general class of partially specified matrices. In both papers the "sparsity pattern" result 
mentioned in §3.5 appears. In [BJL89], Barrett et al. provide an algorithm for finding the 
completion maximizing the determinant of the completed matrix (i.e., the analytic center A* 
mentioned in §3.5). In [HW93], Helton and Woerdeman consider the problem of minimum 
norm extension of Hankel matrices. In [GFS93], Grigoriadis, Frazho and Skelton consider 
the problem of approximating a given symmetric matrix by a Toeplitz matrix and propose a 
solution to this problem via convex optimization. See also [OSA93]. 

Dancis ([Dan92, Dan93] and Johnson and Lunquist [JL92]) study completion problems in 
which the rank or inertia of the completed matrix is specified. As far as we know, such 
problems cannot be cast as LMIPs. 

A classic paper on the contractive completion problem (i.e., completing a matrix so its norm 
is less than one) is Davis, Kahan, and Weinberger in [DKW82], in which an analytic solution 
is given for contractive completion problems with a special block form. The result in this 
paper is closely related to (and also more general than) the matrix elimination procedure 
given in §2.6.2. See [Woe90, NW92, BW92] for more references on contractive completion 
problems. 



Maximum volume ellipsoid contained in a symmetric polytope 

In this section, we establish some properties of the maximum volume ellipsoid contained in 
a symmetric polytope described as: 

V = I z a ; z < 1 , i — 1 , . . . , p I . 

Consider an ellipsoid described by £ — {x x T Q~ 1 x < 1} where Q = Q T > 0. Its volume is 
proportional to ^/detQ. The condition £ C V can be expressed a t Qai < 1 for i — 1, . . . ,p. 
Hence we obtain the maximum volume ellipsoid contained in V by solving the CP 

minimize log det Q~ 

subject to Q > 0, aj Qai < 1, % = 1, . . . ,p 

Suppose now that Q is optimal. From the standard optimality conditions, there are nonneg- 
ative Ai , . . . , A„ such that 



Aj x 

=i 



^-^ a\ U>ai 



and Ai = if aj Qai = 1. Multiplying by Q 1 ' 2 on the left and right and taking the trace 

yields 5^f=i A * ~ n - 

For any x e R n , we have 



-*=5> 



aj Qai 
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For each i, either \i — or afQat — 1, so we conclude 

p 

£ Q~ X — \ \i(x CLi) ■ 

i = l 

Suppose now that x belongs to V '. Then 

p 

X Q~ X < \ A; 



V 

71. 

« = 1 



Equivalently, V C ^fn£ , so we have: 

i.e., the maximum volume ellipsoid contained in V approximates V within the scale factor 
y/n. 

Ellipsoidal approximation 

The problem of computing the minimum volume ellipsoid in R" that contains a given convex 
set was considered by John and Lowner, who showed that the optimal ellipsoid, when shrunk 
about its center by a factor of n, is contained in the given set (see [Joh85, GLS88]). Such 
ellipsoids are called Lowner-John ellipsoids. In the case of symmetric sets, the factor n can 
be improved to \fn (using an argument similar to the one in the Notes and References section 
above). A closely related problem is finding the maximum volume ellipsoid contained in a 
convex set. Similar results hold for these ellipsoids. 

Special techniques have been developed to compute smallest spheres and ellipsoids containing 
a given set of points in spaces of low dimension; see e.g., Preparata and Shamos [PS85, P. 255] 
or Post [POS84]. 

Minimum volume ellipsoids containing a given set arise often in control theory. Schweppe 
and Schlaepfer [Sch68, SS72] use ellipsoids containing the sum and the intersection of two 
given ellipsoids in the problem of state estimation with norm-bounded disturbances. See also 
the articles by Bertsekas and Rhodes [BR71], Chernousko [Che80a, Che80b, Che80c] and 
Kahan [Kah68]. In [RB92], ellipsoidal approximations of robot linkages and workspace are 
used to rapidly detect or avoid collisions. 

Minimum volume ellipsoids have been widely used in system identification; see the ar- 
ticles by Fogel [FOG79], Fogel and Huang [FH82], Deller [Del89], Belforte, Bona and 
Cerone [BBC90], Lau, Kosut and Boyd [LKB90, KLB90, KLB92], Cheung, Yurkovich 
and Passino [CYP93] and the book by Norton [NOR.86]. 

In the ellipsoid algorithm we encountered the minimum volume ellipsoid that contains a given 
half-ellipsoid; the references on the ellipsoid algorithm describe various extensions. 

Efficient interior-point methods for computing minimum volume outer and maximum volume 
inner ellipsoids for various sets are described in Nesterov and Nemirovskii [NN94]. See 
also [KT93]. 

Ellipsoidal approximation via analytic centers 

Ellipsoidal approximations via barrier functions arise in the theory of interior-point algo- 
rithms, and can be traced back to Dikin [Dik67] and Karmarkar [Kar84] for polytopes 
described by a set of linear inequalities. For the case of more general LMIs, see Nesterov and 
Nemirovskii [NN94], Boyd and El Ghaoui [BE93], and Jarre [Jar93c]. 

These approximations have the property that they are easy to compute, and come with 
provable bounds on how suboptimal they are. These bounds have the following form: if 
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the inner ellipsoid is stretched about its center by some factor a (which depends only on 
the dimension of the space, the type and size of the constraints) then the resulting ellipsoid 
covers the feasible set. The papers cited above give different values of a; one may be better 
than another, depending on the number and type of constraints. 

In contrast, the maximum volume inner ellipsoid, i.e., the Lowner-John ellipsoid, is more dif- 
ficult to compute (still polynomial, however) but the factor a depends only on the dimension 
of the space and not on the type or number of constraints, and is always smaller than the 
ones obtained via barrier functions. So an ellipsoidal approximation from a barrier function 
can serve as a "cheap substitute" for the Lowner-John ellipsoid. 



Proof of lemma in §3.7.3 

We prove that the statement 

for every x, (x — xi) Pi (x — xi) — X(x — xq) P q ~ (x — xq) < 1 — A 
where Po, Pi are positive matrices, A > 0, is equivalent to the matrix inequality 



(3.21) 



-Pi Xi - x Po 

{xi-x ) T A-l 

Po -A/ 



< 0. 



(3.22) 



The maximum over x of 

(x — Xi) Pj - (x — Xi) — \(x — Xq) P ~ (x — Xq) (3.23) 

is finite if and only if Pq — \Pl < (which implies that A > 0) and there exists x* satisfying 
P^ 2 (x* - xi) = \Po 2 (x* - xq), (3.24) 

or, equivalently, satisfying 

(P 2 /A - Pt)P^\x* - xi) =xi- x . (3.25) 

In this case, x* is a maximizer of (3.23). Using (3.24), condition (3.21) implies 

(a;* - xi) T Pr 2 (P? - Pq/\)Pi 2 (x* - xi) < 1 - A. 
Using Schur complements, this last condition is equivalent to 



A-i (x*- Xl ) T Pr 2 (PS/x-p?) 

(P$/\-P?)P- 2 (x*-xi) PS/X-P? 



< 0. 



Using (3.25), this implies 



A-l (Xl - Xq) 1 



Xl ~ Xq P 2 /A - Pi 



< o, 



which is equivalent to (3.22). Conversely, it is easy to show that (3.22) implies (3.21) 
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Chapter 4 

Linear Differential Inclusions 



4.1 Differential Inclusions 

A differential inclusion (DI) is described by: 

xeF{x{t),t), x{0) = x o , (4.1) 

where F is a set-valued function on R™ x R + . Any x : R+ — > R" that satisfies (4.1) is 
called a solution or trajectory of the DI (4.1). Of course, there can be many solutions 
of the DI (4.1). Our goal is to establish that various properties are satisfied by all 
solutions of a given DI. For example, we might show that every trajectory of a given 
DI converges to zero as t — > oo. 

By a standard result called the Relaxation Theorem, we may as well assume 
F(x, t) is a convex set for every x and t. The DI given by 

xe Co F(x(t),t), x{0) = x o (4.2) 

is called the relaxed version of the DI (4.1). Since Co F(x(t),t) 3 F[x{t),i), every 
trajectory of the DI (4.1) is also a trajectory of relaxed DI (4.2). Very roughly speak- 
ing, the Relaxation Theorem states that for many purposes the converse is true. (See 
the References for precise statements.) As a specific and simple example, it can be 
shown that for every DI we encounter in this book, the reachable or attainable sets of 
the DI and its relaxed version coincide, i.e., for every T > 

{x{T) | x satisfies (4.1)} = {x(T) \ x satisfies (4.2)} . 

In fact we will not need the Relaxation Theorem, or rather, we will always get it 
"for free" — every result we establish in the next two chapters extends immediately to 
the relaxed version of the problem. The reason is that when a quadratic Lyapunov 
function is used to establish some property for the DI (4.1), then the same Lyapunov 
function establishes the property for the relaxed DI (4.2). 

4.1.1 Linear differential inclusions 

A linear differential inclusion (LDI) is given by 

x £ fix, x(0) = xq, (4.3) 

where SI is a subset of R nx ™. We can interpret the LDI (4.3) as describing a family 
of linear time-varying systems. Every trajectory of the LDI satisfies 

x = A(t)x, x(0) = x , (4.4) 
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for some A : R + — > CI. Conversely, for any A : R + — » 0, the solution of (4.4) is a 
trajectory of the LDI (4.3). In the language of control theory the LDI (4.3) might be 
described as an "uncertain time- varying linear system," with the set Cl describing the 
"uncertainty" in the matrix A(t). 



4.1.2 A generalization to systems 

We will encounter a generalization of the LDI described above to linear systems with 
inputs and outputs. We will consider a system described by 

x = A(t)x+ B u (t)u + B w (t)w, x(0) = xq, 
z = C z {t)x + D zu {t)u + D zw {t)w 

where x : R + — > R™, u : R + — > R™", w : R + — > R"", z : R+ — » R™* . x is referred 
to as the state, u is the control input, w is the exogenous input and z is the output. 
The matrices in (4.5) satisfy 



A(t) B u (t) B w (t) 
C z (t) D zu (t) D zw (t) 



e n. (4.6) 



for all t > 0, where CI C R( n +«-) x (™+«"+™-). We will be more specific about the form 
of Cl shortly. 

In some applications we can have one or more of the integers n u , n w , and n z equal 
to zero, which means that the corresponding variable is not used. For example, the 
LDI x € Clx results when n u = n w = n z = 0. In order not to introduce another term 
to describe the set of all solutions of (4.5) and (4.6), we will call (4.5) and (4.6) a 
system described by LDIs or simply, an LDI. 



4.2 Some Specific LDIs 

We now describe some specific families of LDIs that we will encounter in the next two 
chapters. 



4.2.1 Linear time-invariant systems 

When O is a singleton, the LDI reduces to the linear time-invariant (LTI) system 

x = Ax + B u u + B w w, a;(0) = Xo, 
z = C z x + D zu u + D zw w, 



where 



A B u B v 



(4.7) 



Although most of the results of Chapters 5-7 are well-known for LTI systems, some 
are new; we will discuss these in detail when we encounter them. 
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4.2.2 Polytopic LDIs 

When SI is a polytope, we will call the LDI a polytopic LDI or PLDI. Most of our 
results require that f2 be described by a list of its vertices, i.e., in the form 



Co 






B Wt i 
D Z w,i 



Al B u x B w ^l 
C z ,l D zu x D zw ^ 



(4.8) 



where the matrices (4.8) are given. 

If instead SI is described by a set of I linear inequalities, then the number of 
vertices, i.e., L, will generally increase very rapidly (exponentially) with I. Therefore 
results for PLDIs that require the description (4.8) are of limited interest for problems 
in which SI is described in terms of linear inequalities. 



4.2.3 Norm-bound LDIs 

Another special class of LDIs is the class of norm-bound LDIs (NLDIs), described by 



x = Ax + B p p + B u u + B w w, x(0) 
q = C q x + D qp p + D qu u + D qw w, 



xo, 



z = C z x + D zp p + D zu u + D zw 



w. 



(4.9) 



p=A(t)q, 



|A(i)||<l. 



where A : R + -> R™^™^ with ||A(i)|| < 1 for all t. We will often rewrite the 
condition p = A(t)q, ||A|| < 1 in the equivalent form p T p < q T q. 
For the NLDI (4.9) the set £7 has the form 



Q 



{A+BA{I-D qp A) 



C 



IAIK1 



where 



A B u 



B w 

U Z w 



B 



B n 



D 



zp 



C 



^q Uqu Uqw 



.(4.10) 



Thus, O is the image of the (matrix norm) unit ball under a (matrix) linear-fractional 
mapping. Note that Vt is convex. The set Vt is well defined if and only if Dj D qp < I, 
in which case we say that the NLDI is well-posed. In the sequel we will consider only 
well-posed NLDIs. 

The equations (4.9) can be interpreted as an LTI system with inputs u, w, output 
Z, and a time- varying feedback matrix A(t) connected between q and p. 



Remark: In the sequel, we often drop some terms in (4.9): D qw and D zp will 
always be zero, and very often D qp , D zw , D zu and D qu will be assumed zero. 
These assumptions greatly simplify the formulas and treatment. It is not hard to 
work out the corresponding formulas and results for the more general case. 
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4.2.4 Diagonal norm-bound LDIs 

A useful variation on the NLDI is the diagonal norm-bound LDI (DNLDI), in which 
we further restrict the matrix A to be diagonal. It is given by 



x = Ax + B p p + B u u + B w w, x(0) = xo, 

q = C q x + D qp p + D qu u + D qw w, 

z = C z x + D zp p + D zu u + D zw w, 

Pi = Si(i)qi, \Si(i)\<l, i = l,...,n q 



(4.11) 



We will often express the condition p, = 5i(t)qi, \5i(t)\ < 1 in the simpler form 
\Pi\ 5= \li\- Here we have the componentwise inequalities \pi\ < \qi\ instead of the single 
inequality ||p|| < \\q\\ appearing in the NLDI. In the language of control theory, we 
describe a DNLDI as a linear system with n q scalar, uncertain, time-varying feedback 
gains, each of which is bounded by one. The DNLDI is described by the set 



n 



{A + BA(I-D gp A) X C ||A|| < 1, A diagonal | , (4.12) 



where A, B and C are given by (4.10). 

The condition of well-posedness for a DNLDI is that I — D qp A should be invertible 
for all diagonal A with ||A|| < 1. This is equivalent to det(7 — D qp A) > when A 
takes on its 2 nq extreme values, i.e., \Au\ = 1. Although this condition is explicit, the 
number of inequalities grows exponentially with n q . In fact, determining whether a 
DNLDI is well-posed is NP-hard; see the Notes and References. 

It turns out that for a DNLDI, Co fHs a polytope. In fact, 

Cofl = CoiA + BA{I - DqpAy 1 C A diagonal, \A H \ = l\ , 

i.e., the vertices of Coil are among 2™' images of the extreme points of A under the 
matrix linear-fractional mapping. Therefore a DNLDI can be described as a PLDI, at 
least in principle. But there is a very important difference between the two descriptions 
of the same LDI: the number of vertices required to give the PLDI representation of 
an LDI increases exponentially with n q (see the Notes and References of Chapter 5 
for more discussion). 

Remark: More elaborate variations on the NLDI are sometimes useful. Doyle 
introduced the idea of "structured feedback," in which the matrix A is restricted 
to have a given block-diagonal form, and in addition, there can be equality con- 
straints among some of the blocks. When we have a single block, we have an 
NLDI; when it is diagonal (with no equality constraints), we have a DNLDI. 
Most of the results we present for NLDIs and DNLDIs can be extended to these 
more general types of LDIs. 



4.3 Nonlinear System Analysis via LDIs 

Much of our motivation for studying LDIs comes from the fact that we can use them 
to establish various properties of nonlinear, time-varying systems using a technique 
known as "global linearization". The idea is implicit in the work on absolute stability 
originating in the Soviet Union in the 1940s. 
Consider the system 

x = f(x,u,w,t), z = g(x,u,w,t). (4-13) 
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Suppose that for each x, u, w, and t there is a matrix G(x, u, w,t) G O such that 



f(x,u,w,t) 
g(x,u,w,t) 



G{x,u, w, t) 



x 
u 
w 



(4.14) 



where f2 C j^("+ n ^) x ("+"«+"">) Then of course every trajectory of the nonlinear 
system (4.13) is also a trajectory of the LDI defined by £7. If we can prove that every 
trajectory of the LDI defined by £1 has some property (e.g., converges to zero), then 
a fortiori we have proved that every trajectory of the nonlinear system (4.13) has this 
property. 

4.3.1 A derivative condition 

Conditions that guarantee the existence of such a G are /(0, 0, 0, 0) = 0, g(0, 0, 0, 0) = 
0, and 



G f2 for all x, u, w, t. 



(4.15) 



df_ df_ df_ 

dx du dw 

dg dg dg 

dx du dw 

In fact we can make a stronger statement that links the difference between a pair 
of trajectories of the nonlinear system and the trajectories of the LDI given by £7. 
Suppose we have (4.15) but not necessarily /(0, 0, 0, 0) = 0, g(0, 0, 0, 0) = 0. Then for 
any pair of trajectories (x, w, z) and {x, w, z) we have 









X — X 


x — 


X 


eCo(] 


u — u 


z — 


z 












w — w 



i.e., (x — x, w — w, z — z) is a trajectory of the LDI given by Co f2. 

These results follow from a simple extension of the mean-value theorem: if 
R" -> R™ satisfies 



dx 



g n 



(4.16) 



throughout R™, then for any x and x we have 

4>(x) — 4>(x) G Co£l(a; — x). 
To see this, let c G R™. By the mean- value theorem we have 

c T ^(x)-^x)) = c T ^(C)(x-x) 
for some ^ that lies on the line segment between x and x. Since by assumption 

dx 
we conclude that 

c (4>(x) — 4>(x)) < sup c A(x — x). 

AeCoQ, 

Since c was arbitrary, we see that 4>(x) — 4>(x) is in every half-space that contains the 
convex set Cof2(a; — x), which proves (4.16). 



-(C) en, 
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4.3.2 Sector conditions 

Sometimes the nonlinear system (4.13) can be expressed in the form 

x = Ax + B p p + B u u + B w w, 

q= C q x + D qp p + D qu u+ D qw w, (4 171 

z = C z x + D zp p + D zu u + D zw w, 

Pi = <f>i{qi,t), i = l,...,n q , 

where fa : R x R + — > R satisfy the sector conditions 

a t q 2 < qfa(q,t) < (3 t q 2 

for all q and t > 0, where ct{ and /3, are given. In words, the system consists of a linear 
part together with n q time-varying sector-bounded scalar nonlinearities. 

The variables pi and qi can be eliminated from (4.17) provided the matrix I—D qp A 
is nonsingular for all diagonal A with ckj < An < Pi- In this case, we say the system 
is well-posed. 

Assume now that the system (4.17) is well-posed, and let (4.13) be the equations 
obtained by eliminating the variables p and q. Define 

n= {A+BAil-DgpAj^C | A diagonal, A u e [(*,&]}, 
where A, B and C are given by (4.10). Then, the condition (4.14) holds. 

Notes and References 

Differential inclusions 

The books by Aubin and Cellina [AC 84], Kisielewicz [Kis91], and the classic text by Filip- 
pov [Fil88] cover the theory of differential inclusions. See also the article by Roxin [Rox65] 
and references therein. Background material, e.g., set-valued analysis, can be found in the 
book by Aubin and Frankowska [AF90]. 

The term linear differential inclusion is a bit misleading, since LDIs do not enjoy any particu- 
lar linearity or superposition properties. We use the term only to point out the interpretation 
as an uncertain time-varying linear system. Many authors refer to LDIs using some subset 
of the phrase "uncertain linear time- varying system" . A more accurate term, suggested to 
us and used by A. Filippov and E. Pyatnitskii, is selector-linear differential inclusion. 

PLDIs come up in several articles on control theory, e.g., [MP89, KP87A, KP87B, BY89]. 

Integral quadratic constraints 

The pointwise quadratic constraint p(t) T p(t) < q(t) T q(t) that occur in NLDIs can be gener- 
alized to the integral quadratic constraint J p T p dr < J q T q dr; see §8.2 and §8.3. 

Global linearization 

The idea of replacing a nonlinear system by a time-varying linear system can be found in Liu 
et al. [Liu68, LSL69]. They call this approach global linearization. Of course, approximating 
the set of trajectories of a nonlinear system via LDIs can be very conservative, i.e., there are 
many trajectories of the LDI that are not trajectories of the nonlinear system. We will see 
in Chapter 8 how this conservatism can be reduced in some special cases. 

The idea of global linearization is implicit in the early Soviet literature on the absolute 
stability problem, e.g., Lur'e and Postnikov [LP44, Lur57] and Popov [Pop73]. 
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Modeling systems as PLDIs 

One of the key issues in robust control theory is how to model or measure plant "uncertainty" 
or "variation". We propose a simple method that should work well in some cases. At the 
least, the method is extremely simple and natural. 

Suppose we have a real system that is fairly well modeled as a linear system. We collect 
many sets of input/output measurements, obtained at different times, under different op- 
erating conditions, or perhaps from different instances of the system (e.g., different units 
from a manufacturing run). It is important that we have data sets from enough plants or 
plant conditions to characterize or at least give some idea of the plant variation that can be 
expected. 

For each data set we develop a linear system model of the plant. To simplify the problem 
we will assume that the state in this model is accessible, so the different models refer to the 
same state vector. These models should be fairly close, but of course not exactly the same. 
We might find for example that the transfer matrices of these models at s — differ by about 
10%, but at high frequencies they differ considerably more. More importantly, this collection 
of models contains information about the "structure" of the plant variation. 

We propose to model the system as a PLDI with the vertices given by the measured or 
estimated linear system models. In other words, we model the plant as a time-varying linear 
system, with system matrices that can jump around among any of the models we estimated. 
It seems reasonable to conjecture that a controller that works well with this PLDI is likely 
to work well when connected to the real system. Such controllers can be designed by the 
methods described in Chapter 7. 

Well-posedness of DNLDIs 

See [BY89] for a proof that a DNLDI is well-posed if and only if det(7 — D qp A) > for every 
A with \Aa\ — 1; the idea behind this proof can be traced back to Zadeh and Desoer [ZD63, 
§9.17]; see also [Sae86]. An equivalent condition is in terms of Vo matrices (a matrix is Vo 
if every principal minor is nonnegative; see [FP62, FP66, FP67]): A DNLDI is well-posed if 
and only if I + D qp is invertible, and {I + D qp )~ l (I — D qp ) is a Vo matrix [Gha90]. Another 
equivalent condition is that (7 + D qp ,I — D qp ) is a Wo pair (see [Wil70]). The problem 
of determining whether a general matrix is Vo is thought to be very difficult, i.e., of high 
complexity [CPS92, pl49]. 

Standard branch-and-bound techniques can be applied to the problem of determining well- 
posedness of DNLDIs; see for example [Sd86, GS88, SP89, BBB91, BB92]. Of course these 
methods have no theoretical advantage over simply checking that the 2 Uq determinants are 
positive. But in practice they may be more efficient, especially when used with computation- 
ally cheap sufficient conditions (see below). 

There are several sufficient conditions for well-posedness of a DNLDI that can be checked in 
polynomial-time. Fan, Tits, and Doyle [FTD91] show that if the LMI 

D qp PD qp <P, P diagonal, P > 0, (4.18) 

is feasible, then the DNLDI is well-posed. Condition (4.18) is equivalent to the existence of a 
diagonal (scaling) matrix R such that ||7?7J 9 p7? _1 || < 1. In [FTD91], the authors also give a 
sufficient condition for well-posedness of more general "structured" LDIs, e.g., when equality 
constraints are imposed on the matrix A in (4.11). These conditions can also be obtained 
using the 5-procedure; see [Fer93]. 

Representing DNLDIs as PLDIs 

For a well-posed DNLDI, Cof2 is a polytope, which means that the DNLDI (4.12) can also 
be represented as a PLDI. More specifically, we have 

Cofi = Co{i + BA(7-7J q pA)" 1 C' | A diagonal, |A«| = l} , 
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where A, B and C are given by (4.10). Note that the set on the right-hand side is a polytope 
with at most 2" 9 vertices. We now prove this result. 

Define 

K= {A + BA(I- DgpA)- 1 C | A diagonal, |A is | = l} , 
which is a set containing at most 2 nq points. By definition 

CoO = Co {A + BA(I- DqpA)'^ 1 | A diagonal, |A«| < l} . 

Clearly we have CoO D ColC. We will now show that every extreme point of CoO is 
contained in Co fC, which will imply that Co fC D CoO, completing the proof. 

Let Wext be an extreme point of CoO. Then there exists a linear function ip such that W ex t 
is the unique maximizer of %j) over CoO. In fact, W ex t £ O, since otherwise, the half-space 
{W | ip(W) < V'(Wext) — e}, where e > is sufficiently small, contains O but not Wext, which 
contradicts W e xt G CoO. 

Now, for W £ O, ^>(W) is a bounded linear-fractional function of the first diagonal entry 
An of A, for fixed A22, • • • , A„ qrlq , i.e., it equals (a + 6An)/(c + dAn) for some a, 6, c and 
d. Moreover, the denominator c + dAn is nonzero for An £ [—1,1], since the DNLDI is 
well-posed. Therefore, for every fixed A22, ■ • • , A Uqnq , the function ip achieves a maximum at 
an extreme value of An, i.e., An — ±1. Extending this argument to A22, • ■ ■ , A nqTlq leads 
us to the following conclusion: W e xt — A + BS (I — D qp S)~ C, where S satisfies Sit — ±1. 
In other words, W ex t G Co/C, which concludes our proof. 

We have already noted that the description of a DNLDI as a PLDI will be much larger than 
its description as a DNLDI; see the Notes and References of Chapter 5, and also [BY89] and 
[PS82, KP87A, Kam83]. 

Approximating PLDIs by NLDIs 

It is often useful to conservatively approximate a PLDI as an NLDI. One reason, among 
many, is the potentially much smaller size of the description as an NLDI compared to the 
description of the PLDI. We consider the PLDI 

x = A(t)x, A{t) € fipLDi = Co{A u ...,A L }, (4.19) 

with x(t) £ R n , and the NLDI 

x= (A + B p A{t)C q )x, ||A(i)|| < 1, (4.20) 

with associated set Unldi — {A + B p AC q \ ||A|| < 1}. Our goal is to find A, B p , and C q 
such that fipLDi C ^nldi with the set Onldi as small as possible in some appropriate sense. 
This will give an efficient outer approximation of the PLDI (4.19) by the NLDI (4.20). Since 
Opldi C Onldi implies that every trajectory of the PLDI is a trajectory of the NLDI, every 
result we establish for all trajectories of the NLDI holds for all trajectories of the PLDI. For 
L very large, it is much easier to work with the NLDI than the PLDI. 

For simplicity, we will only consider the case with B p £ R, nxn anc { invertible. Evidently there 
is substantial redundancy in our representation of the NLDI (4.20): We can replace B p by 
aB p and C q by C q /a where a / 0. (We can also replace B p and C q with B P U and VC q 
where U and V are any orthogonal matrices, without affecting the set Onldi, but we will 
not use this fact.) 

We have Opldi C Onldi if for every £ and k — 1, . . . , L, there exists it such that 

B p 7T = (A k - A)£, 7T T 7V < fCfC q S,. 
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This yields the equivalent condition 

(A k - A) T B; T B;\A k -A)< CjC q , k = 1.....L, 
which, in turn, is equivalent to 



B p B T p > 0, 



C T q C q (A k - Af 
A k — A B n B„ 



> 0, fc=l,...,L. 



Introducing new variables V — C q C q and W = B p B p , we get the equivalent LMI in A, V 
and W 



W>0, 



V (A fc - A) 3 

A k - A W 



> 0, fe = l,...,L. 



(4.21) 



Thus we have ^pldi C JInldi if condition (4.21) holds. 

There are several ways to minimize the size of Onldi 3 ^pldi- The most obvious is to 
minimize Tr V + Tr W subject to (4.21), which is an EVP in A, V and W. This objective 
is clearly related to several measures of the size of Onldi, but we do not know of any simple 
interpretation. 

We can formulate the problem of minimizing the diameter of Onldi as an EVP. We define the 
diameter of a set O C R nx ™ a s max{||F — G\\ | F, G £ O}. The diameter of Onldi is equal 
to 2y'X mllx (V)\ m n X (W). We can exploit the scaling redundancy in C q to assume without loss 
of generality that A max (V r ) < 1, and then minimize A max (W A ) subject to (4.21) and V < I, 
which is an EVP. It can be shown that the resulting optimal V opt satisfies A max (V opt ) = 1, 
so that we have in fact minimized the diameter of Onldi subject to Opldi C Onldi- 
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Chapter 5 

Analysis of LDIs: 
State Properties 



In this chapter we consider properties of the state x of the LDI 

x = A(t)x, A{t)efl, (5.1) 

where 57 C R™ " has one of four forms: 

• LTI systems: LTI systems are described by x = Ax. 

• Polytopic LDIs: PLDIs are described by x = A(t)x, A(t) e Co {A\,. . . , A L }. 

• Norm-bound LDIs: NLDIs are described by 

x = Ax + B pP , q=C q x + D qpP , p=A{t)q, ||A(i)|| < 1, 
which we will rewrite as 

x = Ax + B p p, p T p < (C q x + D qp p) T (C q x + D qp p). (5.2) 

We assume well-posedness, i.e., ||-D gp || < 1. 

• Diagonal Norm-bound LDIs: DNLDIs are described by 

x = Ax + B p p, q = C q x + D qp p, 

Pi = Si(i)qi, \Si(i)\<l, i = l,...,n q . 

which can be rewritten as 

x = Ax + B p p, q=C q x + D qp p, |p»| < |?»|, i = l,...,n q . 

Again, we assume well-posedness. 

5.1 Quadratic Stability 

We first study stability of the LDI (5.1), that is, we ask whether all trajectories of 
system (5.1) converge to zero as t — > oo. A sufficient condition for this is the existence 
of a quadratic function V(£) = £ T P£,, P > that decreases along every nonzero 
trajectory of (5.1). If there exists such a P, we say the LDI (5.1) is quadratically 
stable and we call V a quadratic Lyapunov function. 
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Since 

d 
d~t 



V(x) = x T (A(t) T P + PA{t)) x, 



a necessary and sufficient condition for quadratic stability of system (5.1) is 

P > 0, A T P + PA < for all A £ ft. (5.4) 

Multiplying the second inequality in (5.4) on the left and right by P , and defining 
a new variable Q = P^ 1 , we may rewrite (5.4) as 



>0, 



QA 1 + AQ < for all A e ft. 



(5.5) 



This dual inequality is an equivalent condition for quadratic stability. We now show 
that conditions for quadratic stability for LTI systems, PLDIs, and NLDIs can be 
expressed in terms of LMIs. 



• LTI SYSTEMS: Condition (5.4) becomes 
P > 0, A T P+ PA<0. 



(5.6) 



Therefore, checking quadratic stability for an LTI system is an LMIP in the variable 
P. This is precisely the (necessary and sufficient) Lyapunov stability criterion for LTI 
systems. (In other words, a linear system is stable if and only if it is quadratically sta- 
ble.) Alternatively, using (5.5), stability of LTI systems is equivalent to the existence 
of Q satisfying the LMI 



>0, 



AQ + QA T < 0. 



(5.7) 



Of course, each of these (very special) LMIPs can be solved analytically by solving a 
Lyapunov equation (see §1.2 and §2.7). 



Polytopic LDIs: Condition (5.4) is equivalent to 



P>0, 



A\ P+PAi< 0, 



1,...,L. 



(5.8) 



Thus, determining quadratic stability for PLDIs is an LMIP in the variable P. The 
dual condition (5.5) is equivalent to the LMI in the variable Q 



>0, 



QAf + AiQ < 0, 



1, 



L. 



(5.9) 



• Norm-bound LDIs: Condition (5.4) is equivalent to P > and 

T 






A 1 P+ PA PB 



p 



B T P 



for all nonzero £ satisfying 






-C q Cq 







r 1 r> 






<0 



~D qp C q I - D qp D qp 



7T 



< 0. 



(5.10) 



(5.11) 



In order to apply the <S-procedure we show that the set 

A={(H,ir) 1^0, (5.11)} 
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equals the set 

B = {(^tt) |(£,7r)^0, (5.11)}. 

It suffices to show that {(£, n) \ £ = 0, ir ^ 0, (5.11) } = 0. But this is immediate: 
If 7r ^ 0, then condition (5.11) cannot hold without having £ 7^ 0, since I—DTD qp > 0. 
Therefore, the condition that dV(x)/dt < for all nonzero trajectories is equivalent 
to (5.10) being satisfied for any nonzero (£, 7r) satisfying (5.11). (This argument recurs 
throughout this chapter and will not be repeated.) Using the <S-procedure, we find 
that quadratic stability of (5.2) is equivalent to the existence of P and A satisfying 



P>0, 



A>0, 



A T P +PA+ XCjCg 



PB n 



^C q Dq p 



(PB p 



^Cg Dqp) 



-X(I - DLD, 



qp-^qp! 



(5.12) 



<0. 



Thus, determining quadratic stability of an NLDI is an LMIP. Since LMI (5.12) implies 
that A > 0, we may, by defining P = P/X, obtain an equivalent condition 



P>0, 



A T P + PA+ C^Cg PB p + C^Dgp 

(PBp + C<TD q p) T -{I-D T qp D qp ) 



<0, 



(5.13) 



an LMI in the variable P. Thus, quadratic stability of the NLDI has the frequency- 
domain interpretation that the H^ norm of the LTI system 



Ax + B p p, 



1 



CqX- 



DqpP 



is less than one. 

With the new variables Q = P^ 1 , /i = 1/A, quadratic stability of NLDIs is also 
equivalent to the existence of [i and Q satisfying the LMI 



/i>0, Q > 0, 

AQ + QA T + iiBpB T v [iB v D T qv 



{nBpD T qp 



QC, 



T\T 



QC T q 



-n(i 



P'qpL'qp) 



(5.14) 



<0. 



Remark: Note that our assumption of well-posedness is in fact incorporated in 
the LMI (5.12) since it implies / - D qp D^ p > 0. Therefore the NLDI (5.2) is 
well-posed and quadratically stable if and only if the LMIP (5.12) has a solution. 

In the remainder of this chapter we will assume that D qp in (5.2) is zero; the 
reader should bear in mind that all the following results hold for nonzero D qp as well. 

• Diagonal Norm-bound LDIs: For the DNLDI (5.3), we can obtain only a 
sufficient condition for quadratic stability. The condition dV(x)/dt < for all nonzero 
trajectories is equivalent to 

i T {A T P + PA)^ + ifPBpit < 

for all nonzero (£, 71") satisfying 

TTj < (Cg,i£ + D qp,iK) {Cq,i£, + D qPyl Tr) , i = l,...,L, 
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where we have used C q ^ and D qp ^ to denote the ith rows of C q and D qp respec- 
tively. Using the <S-procedure, we see that this condition is implied by the existence 
of nonnegative Ai, . . . , \ n such that 

Z T {A T P+PA)Z + 2ti T PB p ir 

L 

+ Y, X * (( C i4 + Dqp.^f {C q 4 + D qp jir) - Tif) < 

i=l 

for all nonzero (£, ir). With A = diag(A, . . . , A„ ), this is equivalent to 

< 0. (5.15) 



A T P +PA+ C T q AC q PB P + C^AD qp 



B^P + D T qv AC q DT p AD q p - A 

Therefore, if there exist P > and diagonal A > satisfying (5.15), then the 
DNLDI (5.3) is quadratically stable. Checking this sufficient condition for quadratic 
stability is an LMIP. Note that from (5.15), we must have A > 0. 

Equivalently, quadratic stability is implied by the existence of Q > 0, M = 
diag(/ii, . . . , n n ) > satisfying the LMI 



AQ + QA T + BpMBl QC? + B v MD T qv 
C q Q + D qv MB T v -M + D qp MDT p 



< 0, (5.16) 



another LMIP. 



Remark: Quadratic stability of a DNLDI via the 5-procedure has a simple 
frequency-domain interpretation. Denoting by H the transfer matrix H(s) — 
D qp + C' q (sl — A)~ B p , and assuming (A, B p ,C q ) is minimal, quadratic stabil- 
ity is equivalent to the fact that, for some diagonal, positive-definite matrix A, 
||A 1 ' 2 _ffA~ 1 ' 2 || O0 < 1. A -1 ' 2 can then be interpreted as a scaling. (See the Notes 
and References for more details on the connection between the 5-procedure and 
scaling.) 



Remark: Here too our assumption of well-posedness is in fact incorporated 
in the LMI (5.15) or (5.16). The bottom right block is precisely the sufficient 
condition for well-posedness mentioned on page 57. Therefore the DNLDI (5.2) 
is well-posed and quadratically stable if the LMI (5.15) or (5.16) is feasible. 

Note the similarity between the corresponding LMIs for the NLDI and the DNLDI; 
the only difference is that the diagonal matrix appearing in the LMI associated with 
the DNLDI is fixed as the scaled identity matrix in the LMI associated with the NLDI. 
In general it is straightforward to derive the corresponding (sufficient) condition for 
DNLDIs from the result for NLDIs. In the remainder of the book, we will often restrict 
our attention to NLDIs, and pay only occasional attention to DNLDIs; we leave to 
the reader the task of generalizing all the results for NLDIs to DNLDIs. 

Remark: An LTI system is stable if and only if it is quadratically stable. For 
more general LDIs, however, stability does not imply quadratic stability (see the 
Notes and References at the end of this chapter). It is true that an LDI is 
stable if and only if there exists a convex Lyapunov function that proves it; see 
the Notes and References. 
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5.1.1 Coordinate transformations 

We can give another interpretation of quadratic stability in terms of state-space coor- 
dinate transformations. Consider the change of coordinates x = Tx, where detT ^ 0. 
In the new coordinate system, (5.1) is described by the matrix A(t) = T~ l A{t)T . 
We ask the question: Does there exist T such that in the new coordinate system, all 
(nonzero) trajectories of the LDI are always decreasing in norm, i.e., d||ai||/di < 0? 
It is easy to show that this is true if and only if there exists a quadratic Lyapunov 
function V(£) = £, T P£, for the LDI, in which case we can take any T with TT T = P _1 , 
for example, T = P^ 1 ' 2 (these T's are all related by right orthogonal matrix multi- 
plication). With this interpretation, it is natural to seek a coordinate transformation 
matrix T that makes all nonzero trajectories decreasing in norm, and has the small- 
est possible condition number k(T). This turns out to be an EVP for LTI systems, 
polytopic and norm-bound LDIs. 

To see this, let P = p- Tr p- 1 ^ so that k(T) 2 = k(P). Minimizing the condition 
number of T subject to the requirement that d||S||/dt < for all nonzero trajectories is 
then equivalent to minimizing the condition number of P subject to d(x T Px)/dt < 0. 
(Any T with TT T = P~ t where P opt is an optimal solution, is optimal for the original 
problem.) For each of our systems (LTI systems, PLDIs and NLDIs), the change of 
coordinates with smallest condition number is obtained by solving the following EVPs: 

• LTI SYSTEMS: The EVP in the variables P and r\ is 

minimize rj 

subject to (5.6), I < P < r)I 

(In this formulation, we have taken advantage of the homogeneity of the LMI (5.6) in 
the variable P.) 

• Polytopic LDIs: The EVP in the variables P and rj is 

minimize rj 

subject to (5.8), I < P < rjl 

• Norm-bound LDIs: The EVP in the variables P, r\ and A is 

minimize rj 

subject to (5.12), I < P < r]I 

5.1.2 Quadratic stability margins 

Quadratic stability margins give a measure of how much the set i7 can be expanded 
about some center with the LDI remaining quadratically stable. 

• Polytopic LDIs: For the PLDI 

x = {A + A(t))x, A(t) e a Co {A x , ... , A L } , 

we define the quadratic stability margin as the largest nonnegative a for which it is 
quadratically stable. This quantity is computed by solving the following GEVP in P 
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and a: 

maximize a 

subject to P > 0, a > 0, 

A%P + PA + a (Af P + PAi) < 0, i = l,...,L 

• Norm-bound LDIs: The quadratic stability margin of the system 

x = Ax + B p p, p p < o?x C C q x, 

is defined as the largest a > for which the system is quadratically stable, and is 
computed by solving the GEVP in P, X and (3 = a 2 : 



maximize (3 

\ A T P + PA + (3XCTC„ PB P 
subject to P > 0, /3 > 0, T ' q q p 

B^P -XI 

Defining P = P/X, we get an equivalent EVP in P and (3: 
maximize f3 



<0 



subject to P > 0, P > 0, 



A T P + PA + f3C?C q PB p 
BfP -I 



< 



Remark: The quadratic stability margin obtained by solving this EVP is just 
l/||C 9 (s/-,4)- 1 .B p || 00 . 



5.1.3 Decay rate 

The decay rate (or largest Lyapunov exponent) of the LDI (5.1) is defined to be the 
largest a such that 

lim e a *||:r(£)|| = 

t — ►OO 

holds for all trajectories x. Equivalently, the decay rate is the supremum of 

hminf^MM 

t^oo t 

over all nonzero trajectories. (Stability corresponds to positive decay rate.) 

We can use the quadratic Lyapunov function V^(£) = £ P£ to establish a lower 
bound on the decay rate of the LDI (5.1). If dV{x)/dt < —2aV(x) for all trajectories, 
then V(x(t)) < V ' {x{<d))er 2at , so that \\x{t)\\ < e~ Qt K(P) 1 / 2 ||a;(0)|| for all trajectories, 
and therefore the decay rate of the LDI (5.1) is at least a. 

• LTI SYSTEMS: The condition that dV{x)/dt < -2aV(x) for all trajectories is 
equivalent to 

A T P + PA + 2aP < 0. (5.17) 

Therefore, the largest lower bound on the decay rate that we can find using a quadratic 
Lyapunov function can be found by solving the following GEVP in P and a: 

maximize a 

5.18 
subject to P > 0, (5.17) 
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This lower bound is sharp, i.e., the optimal value of the GEVP (5.18) is the decay rate 
of the LTI system (which is the stability degree of A, i.e., negative of the maximum 
real part of the eigenvalues of A). 

As an alternate condition, there exists a quadratic Lyapunov function proving 
that the decay rate is at least a if and only if there exists Q satisfying the LMI 



>0, 



AQ + QA T + 2aQ < 0. 



(5.19) 



• Polytopic LDIS: The condition that dV{x)/dt < — 2aV(x) for all trajectories is 
equivalent to the LMI 



Aj P + PAi + 2aP < 0, 



1. 



,L. 



(5.20) 



Therefore, the largest lower bound on the decay rate provable via quadratic Lyapunov 
functions is obtained by maximizing a subject to (5.20) and P > 0. This is a GEVP 
in P and a. 

As an alternate condition, there exists a quadratic Lyapunov function proving 
that the decay rate is at least a if and only if there exists Q satisfying the LMI 



>0, 



AiQ + QA\ + 2aQ < 0, i = l,...,L. 



(5.21) 



• Norm-bound LDIs: Applying the 5-procedure, the condition that dV(x)/dt < 
— 2aV(x) for all trajectories is equivalent to the existence of A > such that 



A T P +PA+ \CjC q + 2aP PB p 
B^P -XI 



< 0. 



(5.22) 



Therefore, we obtain the largest lower bound on the decay rate of (5.2) by maximizing 
a over the variables a, P and A, subject to P > 0, A > and (5.22), a GEVP. 

The lower bound has a simple frequency-domain interpretation: Define the a- 
shifted Hqo norm of the system by 



\H\ 



sup{ \\H(s)\\ | Res > -a} . 



Then the optimal value of the GEVP is equal to the largest a such that \\H ||oo.a < 1- 
This can be seen by rewriting (5.22) as 



{A + alf P+P{A+aI) + \C?C q PB p 



B T P 



-XI 



<o, 



and noting that the Hqo norm of the system (A + al, B p , C q ) equals ||i?||oo,a- 

An alternate necessary and sufficient condition for the existence of a quadratic 
Lyapunov function proving that the decay rate of (5.2) is at least a is that there 
exists Q and [i satisfying the LMI 



Q > 0, M>0, 

AQ + QA T + fiBpBf + 2aQ QC^ 
C q Q -ill 



(5.23) 



< 0. 
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5.2 Invariant Ellipsoids 

Quadratic stability can also be interpreted in terms of invariant ellipsoids. For Q > 0, 
let £ denote the ellipsoid centered at the origin 

£={£eR" |£ T Q^<1}. 

The ellipsoid £ is said to be invariant for the LDI (5.1) if for every trajectory x of the 
LDI, x(to) G £ implies x(t) G £ for all t > to- It is easily shown that this is the case 
if and only if Q satisfies 

QA T + AQ < 0, for all A e n, 

or equivalently, 

A T P + PA<0, for all Aefl, (5.24) 

where P = Q^ 1 . Thus, for LTI systems, PLDIs and NLDIs, invariance of £ is charac- 
terized by LMIs in Q or P, its inverse. 

Remark: Condition (5.24) is just the nonstrict version of condition (5.4). 



• LTI SYSTEMS: The corresponding LMI in P is 

P > 0, A T P + PA<0, 
and the LMI in Q is 

Q > 0, AQ + PA T < 0, 



(5.25) 
(5.26) 



• POLYTOPIC LDIS: The corresponding LMI in P is 



P > 0, AfP+ PAi < 0, 
and the LMI in Q is 

<3>0, QAf + AiQ<0, 



1,...,L, 



(5.27) 



(5.28) 



• Norm-bound LDIs: Applying the <S-procedure, invariance of the ellipsoid £ is 
equivalent to the existence of A such that 



P>0, 



A>0, 



A T P +PA+ \C T q C q PB P + XCjDgp 
(PB p + XC^D qp ) T -\(I-D^ p D qp ) 



(5.29) 



< 0. 



Invariance of the ellipsoid £ is equivalent to the existence of /z such that 
H > 0, Q > 0, and 



AQ + QA T + \xB p Bl LiB p DT p + QC? 
{^B p D T qp + QC T q ) T -n{I - D qp D T qp ) 



(5.30) 



< 0. 



In summary, the condition that £ be an invariant ellipsoid can be expressed in 
each case as an LMI in Q or its inverse P. 
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5.2.1 Smallest invariant ellipsoid containing a polytope 

Consider a polytope described by its vertices, V = Co{i>i, . . . , v p }. The ellipsoid £ 
contains the polytope V if and only if 

vjQ^Vj < 1, j = I,..., p. 
This condition may be expressed as an LMI in Q 

T 

>0, j = l,...,p, (5.31) 



1 v] 



Vj Q 

or as an LMI in P = Q^ 1 as 
vjP Vj 

Minimum volume 



vjP Vj <l, j = l,..., p. (5.32) 



The volume of £ is, up to a constant that depends on n, -y/det Q. We can minimize 
this volume by solving appropriate CPs. 

• LTI systems: The CP in P is 

minimize log det P~ 1 
subject to (5.25), (5.32) 

• Polytopic LDIS: The CP in P is 

minimize logdetP -1 

& (5.33) 

subject to (5.27), (5.32) 

• Norm-bound LDIs: The CP in P and A is 

minimize log det P~ 1 
subject to (5.29), (5.32) 

Minimum diameter 

The diameter of the ellipsoid £ is 2y / A max (Q)- We can minimize this quantity by 
solving appropriate EVPs. 

• LTI systems: The EVP in the variables Q and A is 

minimize A 

subject to (5.26), (5.31), Q < XI 

• Polytopic LDIs: The EVP in Q and A is 

minimize A 

(5.34) 
subject to (5.28), (5.31), Q < XI 

• Norm-bound LDIs: The EVP in Q, X and /i is 

minimize A 

subject to (5.30), (5.31), Q < XI 
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Remark: These results have the following use. The polytope V represents our 
knowledge of the state at time to, i.e., x(to) £ V (this knowledge may reflect 
measurements or prior assumptions). An invariant ellipsoid £ containing V then 
gives a bound on the state for t > to, i.e., we can guarantee that x(t) £ £ for all 
t> t . 

5.2.2 Largest invariant ellipsoid contained in a polytope 

We now consider a polytope described by linear inequalities: 

v={£en n \aU<h k = l,...,q}. 
The ellipsoid £ is contained in the polytope V if and only if 

max {a^i | £ € £} < 1, k = 1, . . . ,q. 

This is equivalent to 

a k Qa k < 1, k = 1, . . . , q, (5.35) 

which is a set of linear inequalities in Q. The maximum volume of invariant ellipsoids 
contained in V can be found by solving CPs: 

• LTI systems: For LTI systems, the CP in the variable Q is 

minimize log det Q ~ 1 
subject to (5.26), (5.35) 

(Since the volume of £ is, up to a constant, -y/det Q, minimizing log det Q -1 will 
maximize the volume of £). 

• Polytopic LDIs: For PLDIs, the CP in the variable Q is 

minimize log det Q ~ 1 
subject to (5.28), (5.35) 

• Norm-bound LDIs: For NLDIs, the CP in the variables Q and \i is 

minimize log det Q ~ 1 
subject to (5.30), (5.35) 

We also can find the maximum minor diameter (that is, the length of the minor 
axis) of invariant ellipsoids contained in V by solving EVPs: 

• LTI systems: For LTI systems, the EVP in the variables Q and A is 

maximize A 

subject to (5.26), (5.35), XI < Q 

• Polytopic LDIs: For PLDIs, the EVP in the variables Q and A is 

maximize A 

subject to (5.28), 5.35), XI < Q 
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• Norm-bound LDIs: For NLDIs, the EVP in the variables Q, fi and A is 

maximize A 

subject to (5.30), (5.35), XI < Q 

Remark: These results can be used as follows. The polytope V represents the 
allowable (or safe) operating region for the system. The ellipsoids found above 
can be interpreted as regions of safe initial conditions, i.e., initial conditions for 
which we can guarantee that the state always remains in the safe operating region. 

5.2.3 Bound on return time 

The return time of a stable LDI for the polytope V is defined as the smallest T such 
that if x(0) G V , then x(t) G V for all t > T. Upper bounds on the return time can 
be found by solving EVPs. 

• LTI SYSTEMS: Let us consider a positive decay rate a. If Q > satisfies 

QA T + AQ + 2aQ < 0, 

then £ is an invariant ellipsoid, and moreover x(0) G £ implies that x(t) G e~ at £ for 
all t > 0. Therefore if T is such that 

e~ aT £ C V C £, 

we can conclude that if x(0) G V, then x(t) G V for all t > T, so that T is an upper 
bound on the return time. If we use both representations of the polytope, 

V = {x G R" | a T k x < 1, k = l,...,q} = Co{vi,...,v p }, 

the constraint e~ aT £ C V is equivalent to the LMI in Q and 7 

af Qai < 7, fc=l,...,g, (5.36) 

where 7 = e 2aT , and the problem of finding the smallest such 7 and therefore the 
smallest T (for a fixed a) is an EVP in the variables 7 and Q: 

minimize 7 

subject to (5.19), (5.31), (5.36) 

• Polytopic LDIs: For PLDIs, the smallest bound on the return time provable via 
invariant ellipsoids, with a given decay rate a, is obtained by solving the EVP in the 
variables Q and 7 

minimize 7 

subject to (5.21), (5.31), (5.36) 

• Norm-bound LDIs: For NLDIs, the smallest bound on the return time provable 
via invariant ellipsoids, with a given decay rate a, is obtained by solving the EVP in 
the variables Q, 7 and [i 

minimize 7 

subject to (5.23), (5.31), (5.36) 
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Notes and References 

Quadratic stability for NLDIs 

Lur'e and Postnikov [LP44] gave one of the earliest stability analyses for NLDIs: They 
considered the stability of the system 

x — Ax + bpP, q — c q x, p — (f>(q,t)q, (5.37) 

where p and q are scalar. This problem came to be popularly known as the problem of 
"absolute stability in automatic control" . In the original setting of Lur'e and Postnikov, 4> was 
assumed to be a time-invariant nonlinearity. Subsequently, various additional assumptions 
were made about <f>, generating different special cases and solutions. (The family of systems of 
the form (5.37), where 4>{q,t) can be any sector [—1, 1] nonlinearity, is an NLDI.) Pyatnitskii, 
in [Pya68], points out that by 1968, over 200 papers had been written about the system (5.37). 

Among these, the ones most relevant to this book are undoubtedly from Yakubovich. As 
far as we know, Yakubovich was the first to make systematic use of LMIs along with the 
5-procedure to prove stability of nonlinear control systems (see references [Yak62, Yak64, 
Yak67, Yak66, Yak77, Yak82]). The main idea of Yakubovich was to express the re- 
sulting LMIs as frequency-domain criteria for the transfer function G(s) = c q (sl — A)~ 1 b p . 
(Yakubovich calls this method the "method of matrix inequalities".) These criteria are most 
useful when dealing with experimental data arising from frequency response measurements; 
they are described in detail in the book by Narendra and Taylor [NT73]. See also [Bar70b]. 
Popov [Pop73] and Willems [Wil71b, Wil74a] outline the relationship between the problem 
of absolute stability of automatic control and quadratic optimal control. 

The case when p and q are vector-valued signals has been considered much more recently; 
see for instance, [ZK88, ZK87, KR88, BH89]. In [KPZ90, PD90], the authors remark 
that quadratic stability of NLDIs is equivalent to the H-o condition (5.13). However, as 
stated in [PD90], there is no fundamental difference between these and the older results 
of Yakubovich and Popov; the newer results can be regarded as extensions of the works 
of Yakubovich and Popov to feedback synthesis and to the case of "structured perturba- 
tions" [RCDP93, PZP+92]. 

While all the results presented in this chapter are based on Lyapunov stability techniques, 
they have very strong connections with the input-output approach to study the stability 
of nonlinear, uncertain systems. This approach was sparked by Popov [Pop62], who used 
it to study the Lur'e system. Other major contributors include Zames [Zam66a, Zam66b, 
ZF67], Sandberg [San64, San65a, San65b] and Brockett [Bro65, Bro66, BW65]. See 
also the papers and the book by Willems [Wil69a, Wil69b, Wil71a]. The advantage of 
the input-output approach is that the systems are not required to have finite-dimensional 
state [Des65, DV75, LOG90]; the corresponding stability criteria are usually most easily 
expressed in the frequency domain, and result in infinite- dimensional convex optimization 
problems. Modern approaches from this viewpoint include Doyle's /x-analysis [Doy82] and 
Safonov's _R" m -analysis [Saf82, SD83, SD84], implemented in the robustness analysis soft- 
ware packages p-tools and _ftT m -tools [BDG + 91, CS92a], which approximately solve these 
infinite-dimensional problems. See also [AS79, SS81]. 

Quadratic stability for PLDIs 

This topic is much more recent than NLDIs, the likely reason being that the LMI expressing 
quadratic stability of a general PLDI (which is just a number of simultaneous Lyapunov 
inequalities (5.8)) cannot be converted to a Riccati inequality or a frequency-domain criterion. 
One of the first occurrences of PLDIs is in [HB76], where Horisberger and Belanger note 
that the problem of quadratic stability for a PLDI is convex (the authors write down the 
corresponding LMIs). For other results and computational procedures for PLDIs see [PS82, 
PS86, BY89, KP87A, Kam83, EZKN90, KB93, AGG94]; the article [KP87a], where the 
authors develop a subgradient method for proving quadratic stability of a PLDI, which is 
further refined in [Mei91]; the paper [PS86], where the discrete-time counterpart of this 
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problem is considered. See also the articles by Gu et al. [GZL90, GL93B, Gu94], Garofalo 
et al. [GCG93], and the survey article by Barmish et al. [BK93]. 

Necessary and sufficient conditions for LDI stability 

Much attention has also focused on necessary and sufficient conditions for stability of LDIs 
(as opposed to quadratic stability); see for example [PR91b] or [MP86]. Earlier references on 
this topic include [Pya70b] which connects the problem of stability of an LDI to an optimal 
control problem, and [Pya70a, BT79, BT80], where the discrete-time counterpart of this 
problem is considered. 

Vector Lyapunov functions 

The technique of vector Lyapunov functions also yields search problems that can be expressed 
as LMIPs. One example is given in [CFA90]. Here we already have quadratic Lyapunov 
functions for a number of subsystems; the problem is to find an appropriate positive linear 
combination that proves, e.g., stability of an interconnected system. 

Finally, let us point out that quadratic Lyapunov functions have been used to determine 
estimates of regions of stability for general nonlinear systems. See for example [Gha94, 
BD85]. 

Stable LDIs that are not quadratically stable 

An LTI system is stable if and only if it is quadratically stable; this is just Lyapunov's 
stability theorem for linear systems (see e.g., [Lya47, p277]). It is possible, however, for an 
LDI to be stable without being quadratically stable. Here is a PLDI example: 

x = A(t)x, A(t) e Co {A u A 2 }, 



Ai = 



-100 




A,= 



120 



-9 

-18 



From (2.8), this PLDI is not quadratically stable if there exist Qo > 0, Q\ > and Qi > 0, 
not all zero, such that 

Qo = AiQi + QiAi + A 2 Q 2 + Q2A2. 
It can be verified that the matrices 



Q 



5.2 2 
2 24 



0.1 3 
3 90 





2.7 


1 


, Q 2 = 








1 


1 



satisfy these duality conditions. 

However, the piecewise quadratic Lyapunov function 



V(x) — max \x T Pix, x T ?2x\ 
Pi = 



14 -1 
-1 1 











, Pi = 











1 



(5.38) 



proves that the PLDI is stable. To show this, we use the 5-procedure. A necessary and 
sufficient condition for the Lyapunov function V defined in (5.38) to prove the stability of 
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the PLDI is the existence of four nonnegative numbers Ai, A2, A3, A4 such that 

Af Pi + P 1 A 1 - Ai(P 2 - Pi) < 0, 

A^P 1 +P 1 A 2 - A 2 (P 2 -Pi) <0, (5.39) 

Aj P 2 + P 2 A X + A 3 (P 2 - Pi) < 0, A T 2 P 2 + P 2 A 2 + A 4 (P 2 - Pi) < 0. 

It can be verified that Ai = 50, A 2 = 0, A3 = 1, A4 = 100 are such numbers. 

For other examples of stable LDIs that are not quadratically stable, see Brockett [Bro65, 
Bro77, Bro70] and Pyatnitskii [Pya71]. In [Bro77], Brockett uses the S-procedure to 
prove that a certain piecewise quadratic form is a Lyapunov function and finds LMIs similar 
to (5.39); see also [Vm78]. 

Finally, an LDI is stable if and only if there is a convex Lyapunov function that proves it. 
One such Lyapunov function is 

V(£) = sup{||ar(t)|| I x satisfies (5.1), x(0) = £, t> 0}. 

See Brayton and Tong [BT79, BT80], and also [Zub55, MV63, DK71, VV85, PR91B, 
PR91A, Rap90, Rap93, MP89, Mol87]. In general, computing such Lyapunov functions is 
computationally intensive, if not intractable. 

Nonlinear systems and fading memory 

Consider the nonlinear system 

x = f{x,w,t), (5.40) 

with 

df 

— '— G Q for all x, w, t 

OX 

where Q is convex. Suppose the LDI x € 0.x is stable, i.e., all trajectories converge to zero as 
t — > 00. This implies the system has fading memory, that is, for fixed input W, the difference 
between any two trajectories of (5.40) converges to zero. In other words, the system "forgets" 
its initial condition. (See [BC85].) 

Fix an input w and let x and x denote any two solutions of (5.40). Using the mean- value 
theorem from §4.3.1, we have 

— {x-x) = f(x,w,t) - f(x,w,t) = A(t)(x- x) 

for some A(t) G fl. Since the LDI is stable, x(t) — x(t) converges to zero as t — > oo. 

Relation between 5-procedure for DNLDIs and scaling 

We discuss the interpretation of the diagonal matrix A in (5.15) as a scaling matrix. 
Every trajectory of the DNLDI (5.3) is also a trajectory (and vice versa) of the DNLDI 

x = Ax + BpT'ip, 

q = TC q x + TD qp T- l p, (5.41) 

Pi = 5i{t)q u \8i(t)\<l, i = l,...,n q 

for any diagonal nonsingular T. Therefore, if DNLDI (5.41) is stable for some diagonal 
nonsingular T, then so is DNLDI (5.3). T is referred to as a scaling matrix. 
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Treating DNLDI (5.41) as an NLDI, and applying the condition (5.13) for quadratic stability, 
we require, for some diagonal nonsingular T, 



P>0, 



A T P + PA + C T q T 2 C q 
(PBpT- 1 + C q r T 2 D qp T~ 1 



PBpT- 1 + C q r T 2 D qp T~ l 
-(I-T- 1 D% p T a D„T- 1 ) _ 



<0, 



An obvious congruence, followed by the substitution T 2 = A, yields the LMI condition in 
P > and diagonal A > 0: 



A T P + PA + C^AC q PB P + C q AD q 



Bj,P + D* p AC q 



Di p AD q - A 



<0. 



This is precisely LMI (5.15), the condition for stability of the DNLDI (5.3) obtained using 
the 5-procedure. This relation between scaling and the diagonal matrices arising from the 
5-procedure is described in Boyd and Yang [BY89]. 

Pyatnitskii and Skorodinskii [PS82, PS83], and Kamenetskii [Kam83] reduce the problem of 
numerical search for appropriate scalings and the associated Lyapunov functions to a convex 
optimization problem. Saeki and Araki [SA88] also conclude convexity of the scaling problem 
and use the properties of M-matrices to obtain a solution. 

Representing a DNLDI as a PLDI: Implications for quadratic stability 

We saw in Chapter 4 that a DNLDI can be represented as a PLDI, where the number L of 
vertices can be as large as 2 nq . From the results of §5.1, checking quadratic stability for this 
PLDI requires the solution of L simultaneous Lyapunov inequalities in P > (LMI (5.8)). If 
the system is represented as a DNLDI, a sufficient condition for quadratic stability is given 
by the LMI (5.15), in the variables P > and A > 0. 

Comparing the two conditions for quadratic stability, we observe that while the quadratic 
stability condition (5.8) for LDI (5.3), obtained by representing it as a PLDI, is both necessary 
and sufficient (i.e., not conservative), the size of the LMI grows exponentially with n q , the 
size of A. On the other hand, the size of the LMI (5.15) grows polynomially with n q , but 
this LMI yields only a sufficient condition for quadratic stability of LDI (5.3). Discussion of 
this issue can be found in Kamenetskii [Kam83] and Pyatnitskii and Skorodinskii [PS82]. 
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Chapter 6 

Analysis of LDIs: 
Input/Output Properties 

6.1 Input-to-State Properties 

We first consider input-to-state properties of the LDI 

x = A(t)x + B w (t)w, \A{t) B„(t)]efl, (6.1) 

where w is an exogenous input signal. 

• LTI SYSTEMS: The system has the form x = Ax + B w w. 

• POLYTOPIC LDIS: Here the description is 

x = A(t)x + B w (t)w, [A(t) B w (t)}€ Co {[At B wA ],...,[A L B W<L ]} . 

• Norm-bound LDIs: NLDIs have the form 

x = Ax + B pP + B w w, q=C q x, p = A(t)q, ||A(i)|| < 1, 
or equivalently, 

x = Ax + B p p + B w w, q = C q x, p p < q q. 

• Diagonal Norm-bound LDIs: DNLDIs have the form 

x = Ax + B p p + B w w, q = C q x, 
Pi = Si(i)qi, \Si(t)\<l, i = l,...,n q , 
or equivalently, 

x = Ax + B p p+ B w w, q = C q x, \Pi\<\qi\, i = 1, . . . ,n q . 

6.1.1 Reachable sets with unit-energy inputs 

Let 7^ ue denote the set of reachable states with unit-energy inputs for the LDI (6.1), 



n - 



x(T) 



x, w satisfy (6.1), x(0) = 

T 

w T wdt<l, T>0 
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We will bound TZ UC by ellipsoids of the form 

where P > 0. 

Suppose that the function V(£) = £ P£, with P > 0, satisfies 

dV(x)/dt < w w for all x, w satisfying (6.1). 

Integrating both sides from to T, we get 

r T 

V(x(T))-V(x(0))< w T wdt. 
Jo 

Noting that V(x(0j) = since x(0) = 0, we get 



(6.2) 



(6.3) 



V{x{T)) < w'wdt^l, 
Jo 

for every T > 0, and every input w such that L w T w dt < 1. In other words, the 
ellipsoid £ contains the reachable set 7^ uc . 

• LTI systems: Condition (6.3) is equivalent to the LMI in P 

A T P + PA PB V 



P>0, 



<0, 



B T W P -I _ 

which can also be expressed as an LMI in Q = P^ 1 
Q > 0, AQ + QA T + B w Bl < 0. 



(6.4) 



(6.5) 



Remark: For a controllable LTI system, the reachable set is the ellipsoid 
{£, | £ Wc £ < 1}, where W c is the controllability Gramian, defined by 



W c = I e At B w Ble Alt dt. 



Since W c satisfies the Lyapunov equation 
AW C + W C A T + B w Bl = 0, 



(6.6) 



we see that Q — W c satisfies (6.5). Thus the ellipsoidal bound is sharp for LTI 
systems. 



• POLYTOPIC LDIS: For PLDIs, condition (6.3) holds if and only if 
A{t) T P+PA{t) PB w {t) 



P > and 

B w (t) T P 

Inequality (6.7) holds if and only if 



P>0, 



AjP+PA t PB Wii 
Bl. t P -I 



< for all t > 0. 



< 0, i = l,...,L. 



(6.7) 



(6.8) 



Thus the ellipsoid £ contains TZ UC if the LMI (6.8) holds. 
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Alternatively, we can write (6.8) as 

Q>0, QAf + A i Q + B Wyi Bl i <0, i = l,...,L, 
where Q = P . Note that this condition implies that £ is invariant. 
• Norm-bound LDIs: Condition (6.3) holds if and only if P > and 

f(A T P + PA)£, + 2£ T P {B p tt + B w uj) - uj t uj < 
hold for every u) and for every £ and 7r satisfying 

7r T 7r - e T c q T c,e < o. 

Using the <S-procedure, this is equivalent to the existence of P and A satisfying 



(6.9) 



P > 0, A>0, 



A T P +PA + ACjC, PB P PB U 



B T P 
B T P 



-XI 









< 0. 



(6.10) 



Equivalently, defining Q = P , condition (6.3) holds if and only if there exist Q and 
H satisfying 



Q > 0, \i > 0, 



AQ + QA T + B w Bl + iiB p BT QC T q 



C q Q 



-ill 



< 0. (6.11) 



• Diagonal Norm-bound LDIs: Condition (6.3) holds if and only if P > and 
for all £ and n that satisfy 



j n q j 



we have for all lo, 

£, T (A T P + PA)£ + 2£ T P {BpTT + B w uj) - uj t uj < 0. 

It follows from the 5-procedure that this condition holds if there exist P and A 
diag(A, . . . , A„ ) satisfying 



P > 0, A > 0, 



A T P + PA+ ClAC„ PB v PB u 



B T P 
BlP 



-A 









< 0. 



Equivalently defining Q = P , condition (6.3) holds if there exist Q and a diagonal 
matrix M such that 



>0, 



M > 0, 



AQ + QA T + B W B T W + B v MBl QC'J 



a, 



M 



< 0. (6.12) 



For the remainder of this section we leave the extension to DNLDIs to the reader. 

The results described above give us a set of ellipsoidal outer approximations of 
TZ UC - We can optimize over this set in several ways. 
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Smallest outer approximations 

• LTI systems: For LTI systems, we can minimize the volume over ellipsoids £ with 
P satisfying (6.3) by minimizing logdet P^ 1 over the variable P subject to (6.4). This 
is a CP. 

If (A, B w ) is uncontrollable, the volume of the reachable set is zero, and this CP 
will be unbounded below. If (A, B w ) is controllable, then the CP finds the exact 
reachable set, which corresponds to P = W~ l . 

• POLYTOPIC LDIs: For PLDIs, the minimum volume ellipsoid of the form (6.2) 
with P satisfying (6.3), is found by minimizing logdet P^ 1 subject to (6.8). This is a 
CP. 

• Norm-bound LDIs: For NLDIs, the minimum volume ellipsoid of the form (6.2), 
satisfying (6.3), is obtained by minimizing logdet P~ l over the variables P and A 
subject to (6.10). This is again a CP. 

We can also minimize the diameter of ellipsoids of the form (6.2), satisfying (6.3), 
by replacing the objective function logdet P~ l in the CPs by the objective function 
A m ax( J P -1 )- This yields EVPs. 

Testing if a point is outside the reachable set 

The point xo lies outside the reachable set if there exists an ellipsoidal outer approx- 
imation of TZ ue that does not contain it. For LTI systems, xq does not belong to the 
reachable set if there exists P satisfying Xq Pxq > 1 and (6.4). This is an LMIP Of 
course, if (A, B w ) is controllable, then xq does not belong to the reachable set if and 
only if XqW ( T 1 xo > 1, where W c is defined by the equation (6.6). 

For PLDIs, a sufficient condition for a point xq to lie outside the reachable set 
lZ ue can be checked via an LMIP in the variable P: XqPxo > 1 and (6.8). For NLDIs, 
Xq lies outside the reachable set 1Z UC if there exist P and A satisfying Xq Px > 1 
and (6.10), another LMIP. 

Testing if the reachable set lies outside a half-space 

The reachable set 7\L uo lies outside the half-space TL = {£ € R" a T £ > 1 } if and only 
if an ellipsoid £ containing 1Z uo lies outside TL, that is, the minimum value of £ T Q _1 £ 
over £ satisfying a 7 £ > 1 exceeds one. We easily see that 

min {£ T Q -1 £ | a T i > 1 } = \/{a T Qa). 

Therefore, for LTI systems, TL does not intersect the reachable set if there exists 
Q satisfying a T Qa < 1 and (6.5). This is an LMIP. If (A, B w ) is controllable, then TL 
does not intersect the reachable set if and only if a T W c a < 1, where W c is defined by 
the equation (6.6). 

For PLDIs, a sufficient condition for TL not to intersect the reachable set lZ ue 
can be checked via an LMIP in the variable Q: a T Qa < 1 and (6.9). For NLDIs, TL 
does not intersect the reachable set 7^ ue if there exists Q and \x satisfying a T Qa < 1 
and (6.11). This is an LMIP. 

Testing if the reachable set is contained in a polytope 

Since a polytope can be expressed as an intersection of half-spaces (these determine 
its "faces" ) , we can immediately use the results of the previous subsection in order to 
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check if the reachable set is contained in a polytope. (Note we only obtain sufficient 
conditions.) We remark that we can use different ellipsoidal outer approximations to 
check different faces. 



6.1.2 Reachable sets with componentwise unit-energy inputs 

We now turn to a variation of the previous problem; we consider the set of reachable 
states with inputs whose components have unit-energy, that is, we consider the set 



n» 



A 



x(T) 



x, w satisfy (6.1), x(0) = 0, 

(■T 

w\ dt < 1, i = 1, . . . ,n w , 



T> 



Suppose there is a positive-definite quadratic function V(£) = £ T P£, and R 
diag(ri, . . . , r nm ) such that 



,R>0, TrR=l, 

V(x) < w Rw for all x and w satisfying (6.1) 



P > 
d 
dt 



(6.13) 



Then the ellipsoid £ given by (6.2) contains the reachable set. To prove this, we 
integrate both sides of the last inequality from to T, to get 

V(x(T))-V{x{0))< [ w T Rwdt. 



Noting that V(x(0j) = since x(0) = 0, we get 

r T 

V(x(T)) < / w T Rw dt < 1. 
Jo 

Let us now consider condition (6.13) for the various LDIs: 

• LTI SYSTEMS: For LTI systems, condition (6.13) is equivalent to the LMI in P 
and R: 



P > 0, R > and diagonal, Tri? = 1, 
A T P + PA PB U 



B T P 



-R 



(6.14) 



< 0. 



Therefore, if there exist P and R such that (6.14) is satisfied, then the ellipsoid £ 
contains the reachable set TZ ncc . 

Alternatively, using the variable Q = P^ 1 , we can write an equivalent LMI in Q 
and R: 



Q > 0, R > and diagonal, TV R = 1, 
QA T + AQ B v 



B, 



-R 



(6.15) 



< 0. 



Remark: It turns out that with LTI systems, the reachable set is equal to the 
intersection of all such ellipsoids. (For more details on this, see the Notes and 
References.) 



This electronic version is for personal use and may not be duplicated or distributed. 



82 



Chapter 6 Analysis of LDIs: Input/Output Properties 



• POLYTOPIC LDIs: For PLDIs, condition (6.13) holds if and only if 
P > 0, R > and diagonal, Tr R = 1, 



Aj P + PA, PB,„ 



B 1 P 



-R 



(6.16) 



<0, 



1. 



Therefore, if there exist P and R such that (6.16) holds, then the ellipsoid £ 
{£ I t T PZ< 1} contains K ucc . 

With Q = P , condition (6.13) is equivalent to 



Q > 0, R > and diagonal, Tr R = 1, 






(6.17) 



<0, 



,L. 



• Norm-bound LDIs: For NLDIs, applying the 5-procedure, condition (6.13) holds 
if and only if there exist P, A and R such that 



P > 0, R > and diagonal, Tri? = 1, A > 0, 



A T P +PA+ XCjC q PB p PB U 



B T P 
B T P 



-XI 





-R 



(6.18) 



< 0. 



Therefore, if there exists P, A and R such that (6.18) is satisfied, then the ellipsoid £ 
contains the reachable set TZ nC c- 

With Q = P^ 1 , condition (6.13) is also equivalent to the existence of Q, R and /i 
such that 



Q > 0, R > and diagonal, Tr R = 1, \x > 0, 



AQ + QA T + fxBpB^ QC% B w 
C q Q -fil 

Bl -R 



(6.19) 



< 0. 



As in the previous section, we can find the minimum volume and minimum di- 
ameter ellipsoids among all ellipsoids of the form (6.2), satisfying condition (6.13), by 
solving CPs and EVPs, respectively. We can also check that a point does not belong 
to the reachable set or that a half-space does not intersect the reachable set, by solving 
appropriate LMIPs. 



6.1.3 Reachable sets with unit-peak inputs 

We consider reachable sets with inputs w that satisfy w T w < 1. Thus, we are inter- 
ested in the set 



n 



up 



x(T) 



x, w satisfy (6.1), x(0) = 0, I 

w T w < 1, T > J ' 

Suppose that there exists a quadratic function V(£) = £, T P£, with 

P > 0, and dV{x)/dt < for all x, w 
satisfying (6.1), w T w < 1 and V(x) > 1. 
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Then, the ellipsoid £ given by (6.2) contains the reachable set 7?. U p. Let us consider 
now condition (6.20) for the various LDIs: 

• LTI systems: For LTI systems, condition (6.20) is equivalent to P > and 

Z T {A T P + PA)£ + £ T PB w w + w T BlPt, < 
for any £ and w satisfying 

w T w < 1 and £ T P£ > 1. 

Using the <S-procedure, we conclude that condition (6.20) holds if there exist a > 
and (3 > such that for all x and w, 

r A T P + PA+aP PB V 



x 
w 



B r P 



-(31 



x 

w 



+ (3 -a < 0, 



or equivalently 






' A T P + PA + aP PB W 







B T W P -(31 










(3 — a 



< 0. 



(6.21) 



(6.22) 



Clearly we must have a > (3. Next, if (6.22) holds for some (ao,/3o), then it holds for 
all ao > (3 > (3q. Therefore, we can assume without loss of generality that (3 = a, and 
rewrite (6.21) as 



A 1 P + PA+aP PB„ 



< 0. 



B T W P -al 

Therefore, if there exists P and a satisfying 
P > 0, a > 0, and (6.23), 



(6.23) 



(6.24) 



then the ellipsoid £ contains the reachable set !Z nv . Note that inequality (6.23) is not 
an LMI in a and P; however, for fixed a it is an LMI in P. 

Alternatively, condition (6.23) is equivalent to the following inequality in Q = P^ 1 
and a: 



QA T + AQ + aQ B w 
Bl -al 



< 0. 



(6.25) 



• POLYTOPIC LDIS: Condition (6.20) holds if P > and there exists a > satisfying 
AfP + PAj + aP PB„ 



-al 



<o, 



B T P 

Therefore, if there exists P and a satisfying 
P > 0, a > 0, and (6.26), 



1. 



L. 



(6.26) 



(6.27) 



then the ellipsoid £ contains the reachable set 7\L up . Once again, we note that condi- 
tion (6.26) is not an LMI in a and P, but is an LMI in P for fixed a. We can also 
rewrite condition (6.26) in terms of Q = P~ x as the equivalent inequality 



QAj + AiQ + aQ B w , % 



B, 



-al 



<0, 



■ L. 



(6.28) 
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• Norm-bound LDIs: For NLDIs, condition (6.20) holds if and only if P > and 

Z T {A T P + PA)£ + 2£ t P{B w uj + B p ir) < 
for every u>, £ and 7r satisfying 



UJ T LO < 1. 



Z T P£ > 1, 



n 1 it - e C\ C q i < 0. 



Therefore, using the <S-procedure, a sufficient condition for (6.20) to hold is that there 
exist a > and A > such that 



" A T P + PA + aP + \C%C q 


PB p 


PB W 


B T P 

p 


-XI 





B T P 





— al 



< 0. 



If there exists P, a and A satisfying 
P > 0, a > 0, A > 0, 



and (6.29), 



(6.29) 



(6.30) 



the ellipsoid £ P -\ contains the reachable set 7\L up . We note again that inequality (6.29) 
is not an LMI in P and a, but is an LMI in P for fixed a. 

Defining Q = P" 1 , condition (6.20) is also implied by the existence of /i > and 
a > such that 



" QA T + AQ + aQ + nB^B p 


QC T q 


B w 


C q Q 


-iii 





B w 





— al 



< 0. 



(6.31) 



(In fact, the existence of A > and a > satisfying (6.29) is equivalent to the existence 
of fi > and a > satisfying (6.31).) 

For fixed a, we can minimize the volume or diameter among all ellipsoids £ satis- 
fying condition (6.24) for LTI systems, condition (6.27) for PLDIs and condition (6.30) 
for NLDIs by forming the appropriate CPs, or EVPs, respectively. 

Remark: Inequality (6.23) can also be derived by combining the results of §6.1.1 
with an exponential time-weighting of the input w. See the Notes and References 
for details. 

As in §6.1.2, it is possible to determine outer ellipsoidal approximations of the 
reachable set of LTI systems, PLDIs and NLDIs, subject to componentwise unit- 
peak inputs. 



6.2 State-to-Output Properties 

We now consider state-to-output properties for the LDI 

x = A(t)x, z = C z (t)x 
where 



A(t) 
C z (t) 



e n. 



(6.32) 



(6.33) 



• LTI systems: LTI systems have the form x = Ax, z = C z x. 
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• Polytopic LDIS: PLDIs have the form x = A(t)x, z = C z (t)x, where 

e Co 



A(t) 

C z (t) 



A! 



A L 

C L 



• Norm-bound LDIs: NLDIs have the form 

x = Ax + B p p, q=C q x, z = C z x p = A(i)q, ||A(t)|| < 1, 
or equivalently 

x = Ax + B p p, q = C q x, z = C z x, p p < q q. 

• Diagonal Norm-bound LDIs: DNLDIs have the form 

x = Ax + B p p, q = C q x, 

z = C z x, pi = Si{t)q t , \Si(i)\<l, i = l,...,n q , 

or equivalently 

x = Ax + B p p, q = C q x, 

z = C z x, \pi\ < \q t \, i=l,...,n q . 

6.2.1 Bounds on output energy 

We seek the maximum output energy given a certain initial state, 



z z dt 



x = A(t)x, z = C z (t)x 



(6.34) 



where x(0) is given, and the maximum is taken over A(t), C z (t) such that (6.33) holds. 
Suppose there exists a quadratic function V(£) = £ T P£ such that 



P > and — V(x) < —z z, for every x and z satisfying (6.32) 
dt 



(6.35) 



Then, integrating both sides of the second inequality in (6.35) from to T, we get 

r T 

V(x(T)) - V(x(0)) <- z T z dt. 
Jo 

for every T > 0. Since V(x(Tj) > 0, we conclude that V(x(0)) = x(0) T Px(0) is an 
upper bound on the maximum energy of the output z given the initial condition x(0). 
We now derive LMIs that provide upper bounds on the output energy for the 
various LDIs. 

• LTI systems: In the case of LTI systems, condition (6.35) is equivalent to 

P>0, A T P + PA + C T Z C Z < 0. (6.36) 

Therefore, we obtain the best upper bound on the output energy provable via quadratic 
functions by solving the EVP 



minimize x(0) T Px(0) 
subject to P > 0, (6.36) 
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In this case the solution can be found analytically, and it is exactly equal to the output 
energy, which is x(0) T W o x(0), where W is the observability Gramian of the system, 
defined by 



W Q 



A L trtT, 



~M 



e- 'CiC^dt. 



Since W satisfies the Lyapunov equation 

A T W + W A + CjC z = 0, 

it satisfies (6.35). 

With Q = P~ 1 , condition (6.35) is equivalent to 



>0, 



AQ + QA T QC T Z 
C Z Q -I 



< 0. 



(6.38) 



(6.39) 



If Q satisfies (6.39), then an upper bound on the output energy (6.34) is x(0) T Q ^(O). 
• Polytopic LDIs: For PLDIs, condition (6.35) is equivalent to 

P > 0, A{t) T P + PA{t) + C z {t) T C z {t) < for all t > 0. (6.40) 

Inequalities (6.40) hold if and only if the following LMI in the variable P holds: 



P>0, 



a; p + PAi 



C z ,jCz,j < o, 



1,. 



L. 



(6.41) 



Therefore the EVP corresponding to finding the best upper bound on the output 
energy provable via quadratic functions is 



minimize x(0) T Px(0) 
subject to P > 0, (6.41) 

Defining Q = P^ 1 , condition (6.35) is also equivalent to the LMI in Q 

AtQ + QAf QCl, 
C z , t Q -I 



(6.42) 



>0, 



< 0. 



• Norm-bound LDIs: For NLDIs, condition (6.35) is equivalent to P > and 

i T {A T P +PA + C T z C z )i + 2£ r PB p ir < 

for every £ and n that satisfy n T ir < £ T CjC q £. Using the <S-procedure, an equivalent 
condition is the existence of P and A such that 

P > 0, A > 0, 

A T P + PA + C T Z C Z + \C^C q PB P 



B T P 

P 



-XI 



(6.43) 



< 0. 



Therefore we obtain the smallest upper bound on the output energy (6.34) provable 
via quadratic functions satisfying condition (6.35) by solving the following EVP in the 
variables P and A 



minimize x(0) T Px(0) 

subject to P > 0, A > 0, (6.43) 
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With Q = P , condition (6.35) is also equivalent to the existence of \x > satisfying 



Q>0, 



AQ + QA T + fiBpB? QC T Z 
C Z Q -I 



C„Q 







QC T q 

o 

-ill 



< 0. 



• Diagonal Norm-bound LDIs: For DNLDIs, condition (6.35) is equivalent to 

Z T {A T P +PA + C T z C z )i + 2£ T PB p ir < 0, 
for every £ and ir that satisfy 

7T? < (C q 4) T '(C g ,iO , i = l,...,n g . 

Using the <S-procedure, a sufficient condition is that there exists a diagonal A > 
such that 



A T P +PA+ CTC Z + CTAC a PB V 



B T P 



-A 



< 0. 



(6.45) 



Therefore we obtain the smallest upper bound on the output energy (6.34) provable 
via quadratic functions satisfying condition (6.35) and the <S-procedure by solving the 
following EVP in the variables P and A: 



minimize x(0) T Px(0) 

subject to P > 0, A > and diagonal, 



(6.45) 



With Q = P , condition (6.35) is also equivalent to the existence of a diagonal matrix 
M > satisfying 



AQ + QA T + B P MB? QC t z QC T q 
C Z Q -I 



c„ 







-M 



< 0. 



Once again, we leave it to the reader to extend the results in the remainder of the 
section to DNLDIs. 

Maximum output energy extractable from a set 

As an extension, we seek bounds on the maximum extractable energy from x(0), which 
is known only to lie in a polytope V = Co{«i, . . . , v p }. 

We first note that since P > 0, x(0) T Px(0) takes its maximum value on one of 
the vertices v\, . . . ,v p . Therefore, the smallest upper bound on the extractable energy 
from V provable via quadratic Lyapunov functions for any of our LDIs is computed 
by solving (6.37), (6.42) or (6.44) p times, successively setting cc(0) = Vt, i = 1, . . . ,p. 
The maximum of the p resulting optimal values is the smallest upper bound sought. 

As a variation, suppose that x(0) is only known to belong to the ellipsoid £ = 
{£ I £, T X(, < 1}. Then, the smallest upper bound on the maximum output energy 
extractable from £ provable via quadratic functions satisfying (6.35) is obtained by 
replacing the objective function x(0) T Px(0) in the EVPs (6.37), (6.42), and (6.44) by 
the objective function A max (X~ 1 ' 2 i , X -1 ' 2 ), and solving the corresponding EVPs. For 
LTI systems, the exact value of the maximum extractable output energy is obtained. 
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Finally, if xq is a random variable with ExoXq = Xq, then we obtain the small- 
est upper bound on the expected output energy provable via quadratic functions by 
replacing the objective function x(0) T Px(0) in the EVPs (6.37), (6.42), and (6.44) by 
the objective function Tr X P and solving the corresponding EVPs. 



6.2.2 Bounds on output peak 

It is possible to derive bounds on ||z(£)|| using invariant ellipsoids. 

Assume first that the initial condition a;(0) is known. Suppose £ = {£ | £ T P£ < 1} 
is an invariant ellipsoid containing x(0) for the LDI (6.32). Then 

z(t) T z(t) < max Z T C z (tfC z (tK 

for all t > 0. We can express maxj e £ t; T C z (t) T C z (t)£ as the square root of the 
minimum of 5 subject to 



P CJ 

c z SI 



> 0. (6.46) 



• LTI systems: The smallest bound on the output peak that can be obtained via 
invariant ellipsoids is the square root of the optimal value of the EVP in the variables 
P and 5 



minimize 5 

subject to P > 0, x{0) T Px{0) < 1, (6.47) 

(6.46), A T P+ PA<0 

• POLYTOPIC LDIs: For PLDIs, we obtain the smallest upper bound on ||z|| provable 
via invariant ellipsoids by taking the square root of the optimal value of the following 
EVP in the variables P and 5 

minimize 5 

subject to P > 0, x{0) T Px{0) < 1, (6.46), 

AfP + PA.KO, i = l,...,L 

• Norm-bound LDIs: For NLDIs, we obtain the smallest upper bound on ||;z|| 
provable via invariant ellipsoids by taking the square root of the optimal value of the 
following EVP in the variables P, A and 5 

minimize 5 

subject to P > 0, x(0) T Px(0) < 1, A > 0, (6.46), 



A T P +PA+ \C T q C q PB p 
B T V P -XI 



< 
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Remark: As variation of this problem, we can impose a decay rate constraint on 
the output, that is, given a. > 0, we can compute an upper bound on the smallest 
7 such that the output z satisfies 

\\z(t)\\<l^~ a \ 
for all t > 0. This also reduces to an EVP. 

6.3 Input-to-Output Properties 

We finally consider the input-output behavior of the LDI 

x = A{t)x + B w {t)w, x(0) = x , 

(6.48) 
z = C z {t)x + D zw (t)w, 



where 

A(i\ n_..(t\ 

e n. (6.49) 



A{t) B w {t) 
C z (t) D zw (t) 



6.3.1 Hankel norm bounds 

In this section, we assume that D zw (t) = for all t, and consider the quantity 



(f> = max < / z z dt 



J^w T wdt<l, x(0) = 0, 
w (t) = 0, t > T > 



and the maximum is taken over A(t), B(t) and C(t) satisfying (6.49). For an LTI 
system, \[§ equals the Hankel norm, a name that we extend for convenience to LDIs. 
An upper bound for <j) can be computed by combining the ellipsoidal bounds on 
reachable sets from §6.1.1 and the bounds on the output energy from §6.2.1. Suppose 
that P > and Q > satisfy 

— (x T Px) < w T w, — (x T Qx) < -z T z (6.50) 

for all x, w and z satisfying (6.48). Then, from the arguments in §6.1.1 and §6.2.1, we 
conclude that 

<p < sup { /~ z T z dt | x{T) T Px{T) < 1} 
< sup {x{T) T Qx{T) | x{T) T Px{T) < l} 

= \^(P- 1/2 QP- 1/2 )- 

• LTI SYSTEMS: We compute the smallest upper bound on the Hankel norm provable 
via quadratic functions satisfying (6.50) by computing the square root of the minimum 
7 such that there exist P > and Q > satisfying 

A T P + PA + PB w BlP < 0, 
A T Q + QA + CjC z < 0, 
7 P-Q>0. 
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This is a GEVP in the variables 7, P, and Q. It is possible to transform it into an EVP 
by introducing the new variable Q = Q/j. The corresponding EVP in the variables 
7, P, and Q is then to minimize 7 subject to 

A T P + PA + PB w BlP < 0, 

A T Q + QA + CfC z / 1 <Q, 
P-Q>0 

In this case, the optimal value is exactly the Hankel norm. It can be found analytically 
as A max (W c W ) where W c and W Q are the controllability and observability Gramians, 
i.e., the solutions of the Lyapunov equations (6.6) and (6.38), respectively. 

• Polytopic LDIs: We compute the smallest upper bound on the Hankel norm 
provable via quadratic functions satisfying (6.50) by computing the square root of the 
minimum 7 such that there exist P > and Q > satisfying 

AfP + PA, + PB w . t Bl l P < 0, 

A?Q + QAi + C% ti C x ,i<0, i = l,...,L 

7 P-Q>0. 

This is a GEVP in the variables 7, P, and Q. Defining Q = Q/"f, an equivalent EVP 
in the variables 7, P, and Q is to minimize 7 subject to 

AfP + PAi + PB w . l Bl l P < 0, 

Aj Q + QAi + Clfi^h < 0, i = l,...,L 

P-Q>0 



• Norm-bound LDIs: Similarly, the smallest upper bound on the Hankel norm 
provable via quadratic functions satisfying (6.50) is computed by taking the square 
root of the minimum 7 such that there exist P > 0, Q > 0, A > 0, and [i > satisfying 



A T P +PA+ XC^C q PB p PB W 
B*P -XI 

BlP -I 

A T Q + QA + CfC z + iiC T q C q QB P 



BlQ 



-Hi 



<o, 



<o, 



7P - Q > 0. 



This is a GEVP in the variables 7, P, Q, A, and \x. If we introduce the new variables 
Q = Qlli v = /V7 ana - equivalent EVP in the variables 7, P, Q, A, and v is to 
minimize 7 such that 



" A T P +PA+ XC^C q PB P PB W ' 




B^P -XI 


< 


BlP -I \ 




A T Q + QA + C T z C z h + vCTC q QB p 


B T V Q -1 


'I 



<o, 

tip y -vl 

P-Q>0. 
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6.3.2 L 2 and RMS gains 

We assume D zw (t) = for simplicity of exposition. We define the L2 gain of the 
LDI (6.48) as the quantity 

NI2 

SU P Ti — TT" 

IMh^o IM| 2 

where the L2 norm of u is \\u\\^ = L u T u dt, and the supremum is taken over 
all nonzero trajectories of the LDI, starting from x(0) = 0. The LDI is said to be 
nonexpansive if its L2 gain is less than one. 

Now, suppose there exists a quadratic function V(£) = £ P£, P > 0, and 7 > 
such that for all t, 

—V(x) + z z — 7 w w < for all x and w satisfying (6.48). (6.51) 

Then the L2 gain of the LDI is less than 7. To show this, we integrate (6.51) from 
to T, with the initial condition x(0) = 0, to get 

V(x{T)) + / (z T z - "f 2 w T w) dt < 0. 
Jo 

since V(x(T)) > 0, this implies 

1Mb 



<7- 



w 2 



Remark: It can be easily checked that if (6.51) holds for V(£) = £, T P£, P > 0, 
then 7 is also an upper bound on the RMS gain of the LDI, where the root- mean- 
square (RMS) value of £ is defined as 

/ , rT \ 1/2 

RMS(0= (limsup- / f^dt 

and the RMS gain is defined as 

RMS(z) 
sup r~r- 

RMS(«)/o RMS(w) 



Now reconsider condition (6.51) for LDIs. 
• LTI SYSTEMS: Condition (6.51) is equivalent to 



A T P +PA+ C T Z C Z PB W 
B T W P -1 2 I 



< 0. (6.52) 



Therefore, we compute the smallest upper bound on the L2 gain of the LTI system 
provable via quadratic functions by minimizing 7 over the variables P and 7 satisfying 
conditions P > and (6.52). This is an EVP. 

Remark: Assuming that (A, B w , C z ) is minimal, this EVP gives the exact value 
of the L2 gain of the LTI system, which also equals the H^ norm of its transfer 
matrix, \\C z (sI — A)~ 1 B W ||<x,. 
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Assume B w 7^ and C z ^ 0; then the existence of P > satisfying (6.52) is 
equivalent to the existence of Q > satisfying 



AQ + QA T + B w Blh 2 QC T Z 
C Z Q -I 



< 0. 



• Polytopic LDIs: Condition (6.51) is equivalent to 
\T d i & a . i r-iT r< . pb 



AjP + PAi + Ci^i 
B T P 



2] 



-Yi 



<o, 



(6.53) 



Assume there exists «o f° r which B wio ^ 0, and jo for which C z j 7^ 0. Then there 
exists P > satisfying (6.53) if and only if there exists Q > satisfying 



v2l 



(^z.iQ 



'z,i 

-1 2 I 



<o, 



1,...,L. 



We get the smallest upper bound on the L2 gain provable via quadratic functions by 
minimizing 7 (over 7 and P) subject to (6.53) and P > 0, which is an EVP. 

• Norm-bound LDIs: For NLDIs, condition (6.51) is equivalent to 

i T {A T P +PA+ C T z C z )i + 2£ T P {B P TT + B w w) - 1 2 w T w < 

for all £ and n satisfying 

n T 7T < ecJC q £. 

This is true if and only if there exists A > such that 



A T P + PA + C T Z C Z + XC^C q PB p PB W 



-7 2 / 



B T P 
BlP 



-XI 





< 0. 



(6.54) 



Therefore, we obtain the best upper bound on the L2 gain provable via quadratic 
functions by minimizing 7 over the variables 7, P and A, subject to P > 0, A > 
and (6.54). 

If B w 7^ and C z 7^ 0, then the existence of P > and A > satisfying (6.54) is 
equivalent to the existence of Q > and fi > satisfying 



QA T + AQ + B w Bl + nB p B^ QC T q QC 7 Z 



C q Q 
C Z Q 



-Hi 






-7 2 / 



< 0. 



• Diagonal Norm-bound LDIs: For DNLDIs, condition (6.51) is equivalent to 
i T (A T P +PA+ C T z C z )i + 2£ T P {B p tt + B w w) - 1 2 w T w < 



for all £ and 7r satisfying 



1,.. 



! "■<?• 
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Using the <S-procedure, a sufficient condition is that there exist a diagonal A > such 
that 



A T P + PA + C T Z C Z + CjAC q PB P PB n 



B T P 

P 

B T P 



-A 







-1 2 I 



< 0. 



(6.55) 



We obtain the best such upper bound by minimizing 7 over the variables 7, P and A, 
subject to P > 0, A > and (6.55). 

If B w 7^ and C z 7^ 0, then the existence of P > and A > satisfying (6.54) is 
equivalent to the existence of Q > and M = diag(/ii, . . . , \i n ) > satisfying 



QA T + AQ + B w Bl + B p MBT QC T q QCJ 
C q Q -M 

C Z Q - 7 2 / 



< 0. 



6.3.3 Dissipativity 

The LDI (6.48) is said to be passive if every solution x with x(0) = satisfies 

w T z dt>0 
for all T > 0. It is said to have dissipation r/ if 

r-T 

(wz — rjw w) dt > 



(6.56) 



holds for all trajectories with x(0) = and all T > 0. Thus passivity corresponds to 
nonnegative dissipation. The largest dissipation of the system, i.e., the largest number 
r\ such that (6.56) holds, will be called its dissipativity . 

Suppose that there is a quadratic function V(£) = £, T P£, P > 0, such that 



for all x and w satisfying (6.48), — V( x ) ~ 2w z + 2r\w w < 0. 

dt 

Then, integrating (6.57) from to T with initial condition x(0) = yields 

i-T 

V(x{T)) - / (2w T z - 2rjw T w) dt < 0. 

Since V(x(T)) > 0, we conclude 



(6.57) 



(w z — r/w w) dt > 0, 



which implies that the dissipativity of the LDI is at least r\. Now reconsider condi- 
tion (6.57) for the various LDIs. 



• LTI systems: For LTI systems, condition (6.57) is equivalent to 

A T P + PA PB W - C T Z 

B T W P-C Z 2r)I-{D T zw + D zu 



< 0. 



(6.58) 



We find the largest dissipation that can be guaranteed using quadratic functions by 
maximizing rj over the variables P and r), subject to P > and (6.58), an EVP. 
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Assuming that the system (A, B w , C z ) is minimal, the optimal value of this EVP 
is exactly equal to the dissipativity of the system, which can be expressed in terms of 
the transfer matrix H(s) = C z (sl — A)~ 1 B W + D zw as 

Xmin(H( S ) + H(s)*) 



inf 

Res>0 2 

This follows from the PR lemma. 

Defining Q = P^ 1 , condition (6.58) is equivalent to 



QA T + AQ B w - QC T Z 

Bl-C Z Q 2r,I-(D T zw + D z 



< 0. 



(6.59) 



(6.60) 



• Polytopic LDIs: For PLDIs, condition (6.57) is equivalent to 
AfP + PA, PB Wtl - C T 



3 T 

W . % 



Bl;P-C 7 . 



Z,l 

D 7 



<o, 



1, 



L, 



(6.61) 



We find the largest dissipation that can be guaranteed via quadratic functions by 
maximizing r\ over the variables P and r\ satisfying P > and (6.61). This is an EVP; 
its optimal value is a lower bound on the dissipativity of the PLDI. 
If we define Q = P , condition (6.57) is equivalent to 



QAJ + A t Q 



Bn 



QCl 



B, 



C Z <Q 2 V I-(D T wA + D z 



<0, 



1. 



,L. 



(6.62) 



• Norm-bound LDIs: For NLDIs, using the 5-procedure, the condition (6.57) is 
true if and only if there exists a nonnegative scalar A such that 

CJ PB p 

B T W P-C Z 2 V I-(D T W +D ZW ) 

-XI 



A T P +PA + \C T C q 



PB„ 



B T P 



< 0. 



(6.63) 



Therefore we can find the largest dissipation provable with a quadratic Lyapunov 
function by maximizing r/ (over r\ and P), subject to P > and (6.63). This is an 
EVP. Defining Q = P , condition (6.57) is equivalent to the existence of \x > 
satisfying 



QA T + AQ + fiB p B T B w - 

Bl-C Z Q 2 V I-(D 

C„Q 



-QCJ 


QC T q ~ 


ZW T L>ZW) 








-fll _ 



< 0. 



(6.64) 



6.3.4 Diagonal scaling for componentwise results 

We assume for simplicity that D zw (t) = 0. Assuming that the system (6. 
many inputs as outputs, we consider the system 



x = A(t)x + B w {t)T- 1 w, 
z = TC z {t)x. 



x(0) = 0, 



has as 



(6.65) 



where T is a positive-definite diagonal matrix, which has the interpretation of a scal- 
ing. We will see that scaling enables us to conclude "componentwise" results for the 
system (6.65). 
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Consider for example, the "scaled L2 gain" 

a = mi sup . 

T diagonal, IHU#0 IMI2 

T> 
The scaled L2 gain has the following interpretation: 



max sup 

»=i,...,n« ii^n^o lki| 



< a. 



We show this as follows: For every fixed scaling T = diag(ii, . . . , t nz ), 



SU P||™|| 2 ^0 



ETA 

su P||m)|| 2 #0 y^™^ +2 



i 1 'ill^ll2_ 
2 

J 112 



> 



SU P|l™i||2^0 ■ 



Er*ik 
2 
2 



iwi? 



12- 



Since the inequality is true for all diagonal nonsingular T, the desired inequality 
follows. We now show how we can obtain bounds on the scaled L2 gain using LMIs 
for LTI systems and LDIs. 

• LTI SYSTEMS: The L2 gain for the LTI system scaled by T is guaranteed to be 
less than 7 if there exists P > such that the LMI 



A T P +PA+ C T Z SC Z PB W 
BlP - 7 2 S 



<0 



holds with S = T T T. The smallest scaled L2 gain of the system can therefore be 
computed as a GEVP. 

• POLYTOPIC LDIs: For PLDIs, the scaled L 2 gain is guaranteed to be lower than 
7 if there exists S > 0, with S diagonal, and P > which satisfy 



A^ r + PAi + G z jiC Z: i 


PB 


B T P 


-i 2 s \ 



<o, 



(6.66) 



The optimal scaled L 2 gain 7 is therefore obtained by minimizing 7 over (7, P and 
S) subject to (6.66), P > 0, S > and S diagonal. This is a GEVP. 

• Norm-bound LDIs: For NLDIs, the scaled L 2 gain (with scaling T) is less than 7 
if there exist A > 0, P > 0, and S > 0, S diagonal, such that the following LMI holds: 



A T P + PA + C T Z SC Z + XCjC q PB P PB V 



B T P 

p 

B T P 



-XI 







-7 2 S 



< 0. 



(6.67) 



Therefore, minimizing 7 2 subject to the constraint (6.67) is an EVP. 
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Notes and References 

Integral quadratic constraints 

In §8.2 (and §10.9) we consider a generalization of the NLDI in which the pointwise condition 
P P 5i Q Q i s replaced by the integral constraint 

p p dt < I q qdt. 

Jo 

Many of the results derived for NLDIs and DNLDIs then become necessary and sufficient. 
Integral quadratic constraints were introduced by Yakubovich. For a general study of these, 
we refer the reader to the articles of Yakubovich (see [Yak92] and references therein), and 
also Megretsky [MEG93, MEG92A, Meg92b]. 

Reachable sets for componentwise unit-energy inputs 

We study in greater detail the set of states reachable with componentwise unit-energy inputs 
for LTI systems (see §6.1.2). We begin by observing that a point xo belongs to the reachable 
set if and only if the optimal value of the problem 

minimize max / w 4 dt 



subject to x — —Ax + B w w, x(0) = xo, lim x(t) — 

t — >oo 

is less than one. This is a multi-criterion convex quadratic problem, considered in §10.8. In 
this case, the problem reduces to checking whether the LMI 



xqPx < 1, 



A T P + PA PB 



<0, 



B T P -R 

P > 0, R > 0, diagonal, Tr R = 1 

is feasible. This shows that the reachable set is the intersection of ellipsoids {£ | £ T P£ < l} 
satisfying equation (6.14). 

A variation of this problem is considered in [SZ92], where the authors consider inputs w 
satisfying J ww dt < Q, or equivalently 

w T Ww dt < Tr WQ for every W > 0. 

They get necessary conditions characterizing the corresponding reachable set. We now derive 
LMI conditions that are both necessary and sufficient. 

As before, xo belongs to the reachable set if and only the optimal value of the problem 



minimize 

'o 

subject to x — —Ax + B w w, x(0) = xo, lim xit) — 

t — >oo 

is less than Tr WQ for every W > 0. This is equivalent to infeasibility of the LMI (in the 
variables P and W): 



XoPx -TrWQ>0, P>0 



A T P + PA PB 
B T P -W 



< 0. 
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These are precisely the conditions obtained in [SZ92]. 



Reachable sets with unit-peak inputs 

Schweppe [Sch73, §4.3.3] considers the more general problem of time-varying ellipsoidal ap- 
proximations of reachable sets with unit-peak inputs for time- varying systems; the positive- 
definite matrix describing the ellipsoidal approximation satisfies a matrix differential equa- 
tion, terms of which closely resemble those in the matrix inequalities described in this book. 
Sabin and Summers [SS90] study the approximation of the reachable set via quadratic func- 
tions. A survey of the available techniques can be found in the article by Gayek [Gay91]. 
See also [FG88]. 

The technique used to derive LMI (6.23) can be interpreted as an exponential time-weighting 
procedure. Fix a > 0. Then for every T > 0, rewrite LDI (6.1), with new exponentially 
time-weighted variables xx(t) = e a ^~ T '' 2 x(i) and v(t) — e a ^ t ~ T '' 2 w(t) as 



± T = [A(t) + -I)x T +B w (t)v, x T (0)=0. 



Since w(t) T w(t) < 1, we have J v T v dr < 1/ct. 

Now suppose that P > satisfies condition (6.23), which we rewrite for convenience as 



(A + aI/2) T P + P(A + aI/2) PB W 
BlP -al 



< 0. 



(6.68) 



The results of §6.1.1 imply that x T (T) satisfies x T (T) T Px T (T) < 1; therefore, x{T) T Px(T) < 



1. Since LMI (6.68) is independent of T, the ellipsoid |c 
set with unit-peak inputs. 



Px < 1 } contains the reachable 



Bounds on overshoot 

The results of §6.1.3 can be used to find a bound on the step response peak for LDIs. Consider 
a single-input single-output LDI, subject to a unit-step input: 



x = A(t)x + b w (t), x(0) 
s — c z (t)x, 



A(t) 
c(t) 



b w (t) 





e n. 



Since a unit-step is also a unit-peak input, an upper bound on max t >o \s(t)\ (i.e., the over- 
shoot) can be found by combining the results on reachable sets with unit-peak inputs (§6.1.3) 
and bounds on output peak (§6.2.2). 

Suppose there exist Q > and a > satisfying (6.25), 

A{t)Q + QA{t) T + aQ+ -b w (t)b w (t) T < 0. 
a 

Then, from §6.2.2, we conclude that max t >o \s(t)\ does not exceed M, where M 2 = max t > 
Cz{t) T Qc z (t). This can be used to find LMI-based bounds on overshoot for LTI systems, 
PLDIs and NLDIs. We note that the resulting bounds can be quite conservative. These 
LMIs can be used to determine state-feedback matrices as well (see the Notes and References 
of Chapter 7). 
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Bounds on impulse response peak for LTI systems 

The impulse response h(t) — ce l b of the single-input single-output LTI system 

x — Ax + bu, x(0) = 0, y — ex 

is just the output y with initial condition x(0) = b and zero input. Therefore, we can com- 
pute a bound on the impulse response peak by combining the results on invariant ellipsoids 
from §5.2 with those on the peak value of the output from §6.2.2. The bound obtained this 
way can be quite conservative. For instance, consider the LTI system with transfer function 



n 
S + Sl J-J- S + Si' 



where Si+i 2> Si > 0, i.e., the dynamics of this system are widely spaced. The bound on 
the peak value of the impulse response of the system computed via the EVP (6.47) turns 
out to be In — 1 times the actual maximum value of the impulse response, where n is the 
dimension of a minimal realization of the system [Fer93]. We conjecture that the bound 
can be no more conservative, that is, for every LTI system, the bound on the peak of the 
impulse response computed via the EVP (6.47) can be no more than 2n — 1 times the actual 
maximum value of the impulse response. 

Examples for which the bound obtained via the EVP (6.47) is sharp include the case of passive 
LTI systems. In this case, we know there exists a positive-definite P such that A T P + PA < 0, 
Pb — c , so that the maximum value of the impulse response is cb and is attained for t — 0. 
This is an illustration of the fact that passive systems have nice "peaking" properties, which 
is used in nonlinear control [Kok92], 
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Chapter 7 

State-Feedback Synthesis for LDIs 



7.1 Static State-Feedback Controllers 



We consider the LDI 

x = A(t)x + B w {t)w + B u {t)u, 



z = C z {t)x + D zw {t)w + D zu (t)u, 



A(t) B w (t) B u (t) 

C z {t) D zw {t) D zu {t) 



(7.1) 



en, 



where O has one of our special forms (i.e., singleton, polytope, image of a unit ball 
under a matrix linear- fractional mapping). Here u : R + —> R™" is the control input 
and w : R + — » R" m is the exogenous input signal. 

Let K G R n " n , and suppose that u = Kx. Since the control input is a linear 
function of the state, this is called (linear, constant) state- feedback , and the matrix K 
is called the state-feedback gain. This yields the closed-loop LDI 



x = (A(t) + B u {t)K) x + B w (t)w, 
z = (C z (t) + D zu {t)K) x + D zw {t)w. 



(7.2) 



In this chapter we consider the state- feedback synthesis problem, i.e., the problem of 
finding a matrix K so that the closed-loop LDI (7.2) satisfies certain properties or 
specifications, e.g., stability. 

Remark: Using the idea of global linearization described in §4.3, the methods of 
this chapter can be used to synthesize a linear state-feedback for some nonlinear, 
time-varying systems. As an example, we can synthesize a state-feedback for the 
system 



x - f(x,w,u,t), 
provided we have 



z - g(x,w,u,t), 



d£ d£ d£ 

dx dw du 

dg dg dg 

dx dw du 



(x, w,u, t) £ O, 
and /(0, 0, 0, t) = 0, g(0, 0, 0, t) = 0, for all x, t, w and u. 
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7.2 State Properties 

We consider the LDI 

x = A(t)x + B u (t)u, [A(t) B u (t)\ <= fl. (7.3) 

• LTI systems: For LTI systems, (7.3) becomes 

x = Ax + B u u. (7.4) 

• POLYTOPIC LDIS: PLDIs are given by 

x = A(t)x + B u {t)u, [A{t) B u {t)}e Co {[Ai B uA ],....[A L B U , L ]} . (7.5) 

• Norm-bound LDIs: For NLDIs, (7.3) becomes 

x = Ax + B u u + B p p, q = C q x + D qu u + D qp p 

p=A(t)q, ||A(t)||<l. 

Equivalently, we have 

X = Ax + B u u + B p p, q = C q x + D qp p + D qu u, p T p < q T q. 

• Diagonal Norm-bound LDIs: For DNLDIs, equation (7.3) becomes 

x = Ax + B u u + B p p, q = C q x + D qp p + D qu u, 

Pi = Si(i)qi, \Si(t)\<l, i = l,...,L. 

Equivalently, we have 

x = Ax + B u u + B p p, q = C q x + D qp p + D qu u, 

\Pi\ < \Qi\, i = l,...,n q . 

7.2.1 Quadratic stabilizability 

The system (7.1) is said to be quadratically stabilizable (via linear state-feedback) if 
there exists a state-feedback gain K such that the closed-loop system (7.2) is quadrat- 
ically stable (hence, stable). Quadratic stabilizability can be expressed as an LMIP. 

• LTI systems: Let us fix the matrix K. The LTI system (7.4) is (quadratically) 
stable if and only if there exists P > such that 

{A + B U K) T P + P{A + B U K) < 0, 

or equivalently, there exists Q > such that 

Q(A+B u K) T + (A + B u K)Q<0. (7.8) 

Neither of these conditions is jointly convex in K and P or Q, but by a simple change 
of variables we can obtain an equivalent condition that is an LMI. 

Define Y = KQ, so that for Q > we have K = YQ^ 1 . Substituting into (7.8) 
yields 

AQ + QA T + B U Y + Y T Bl < 0, (7.9) 
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which is an LMI in Q and Y. Thus, the system (7.4) is quadratically stabilizable if 
and only if there exist Q > and Y such that the LMI (7.9) holds. If this LMI is 
feasible, then the quadratic function V(£) = (, T Q~ X t; proves (quadratic) stability of 
system (7.4) with state-feedback u = YQ~ 1 x. 

Remark: We will use the simple change of variables Y = KQ many times in 
the sequel. It allows us to recast the problem of finding a state-feedback gain 
as an LMIP with variables that include Q and Y; the state-feedback gain K is 
recovered as K = YQ~ l . 

An alternate equivalent condition for (quadratic) stabilizability, involving fewer 
variables, can be derived using the elimination of matrix variables described in §2.6.2: 
There exist Q > and a scalar a such that 

AQ + QA T - crB u BZ < 0. (7.10) 

Since we can always assume a > in LMI (7.10), and since the LMI is homogeneous 
in Q and a, we can without loss of generality take a = 1, thus reducing the number of 
variables by one. If Q > satisfies the LMI (7.10), a stabilizing state- feedback gain is 
given by K = -(a/2)B T Q~\ 

From the elimination procedure of §2.6.2, another equivalent condition is 

B T U (AQ + QA T ) B u < 0, (7.11) 

where B u is an orthogonal complement of B u . For any Q > satisfying (7.11), a 
stabilizing state-feedback gain is K = — (a/2)B T Q^ 1 , where a is any scalar such 
that (7.10) holds (condition (7.11) implies that such a scalar exists). 

For LTI systems, these LMI conditions are necessary and sufficient for stabiliz- 
ability. In terms of linear system theory, stabilizability is equivalent to the condition 
that every unstable mode be controllable; it is not hard to show that this is equivalent 
to feasibility of the LMIs (7.9), (7.11), or (7.10). 

• POLYTOPIC LDIS: For the PLDI (7.5), the same argument applies. Quadratic 
stabilizability is equivalent to the existence of Q > and Y with 

QAf + AiQ + B Uii Y + Y T Bl ii < 0, i = l,...,L. (7.12) 

• Norm-bound LDIs: With u = Kx, the NLDI (7.6) is quadratically stable (see 
LMI (5.14)) if there exist Q > and fi > such that 



AQ + QA 1 +B U KQ , /R pT /w , n ; ; 
+ QK J 'Bl + fiBpBl 



'Tr>T , ,,n oT I t~ p qp 



[iB p D T + Q(C q + D qu K? 



HD„BI + (C q + D qu K)Q -ii(I - D„DQ 



<0. 



This condition has a simple frequency-domain interpretation: The H^ norm of the 
transfer matrix from p to q for the LTI system x = (A + B u K)x + B p p, q = (C q + 
D qu K)x + D qp p is less than one. 

With Y = KQ, we conclude that the NLDI is quadratically stabilizable if there 
exist Q > 0, fi > and Y such that 



AQ QA + fur Bp »B p Dl + QCT + YTDT u 



V +B U Y + Y+Bl 
[iB qp B T p + C q Q + D qu Y - fi{I - D qp D T qp ) 
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Using the elimination procedure of §2.6.2, we obtain the equivalent conditions: 



/ AQ + QA T \ 
\ - oB u B T u + iiB p B T p ) 


QC q + fiB p D qp - 


- aB u D qu 


C q Q + fxDq P B p — aD qu B u 


-H(I - D qp D T qp ) - 


- cjD qu B T qu 


-/i(J - D qp D T qp ) < 0. 







<o, 



(7.14) 



The latter condition is always satisfied since the NLDI is well-posed. By homogeneity, 
we can set \x = 1. If Q > and a > satisfy (7.14) with \x = 1, a stabilizing 
state-feedback gain is K = (— a /2)B^Q~ 1 . 

An alternate, equivalent condition for quadratic stabilizability is expressed in 
terms of the orthogonal complement of [B^ DT U ] T , which we denote by G: 



G 1 



' AQ + QA T + fj.B p B% 


QCT + uBpDl ' 


C q Q + vD qp B T p 


-H{I - D qp D T qp ) 



G <0. 



(7.15) 



(Again, we can freely set /z = 1 in this LMI.) 

In the remainder of this chapter we will assume that D qp in (7.6) is zero in order 
to simplify the discussion; all the following results can be extended to the case in 
which D qp is nonzero. 

• Diagonal Norm-bound LDIs: With u = Kx, the DNLDI (7.7) is quadratically 
stable (see LMI (5.16)) if there exist Q > 0, M = diag(/ii, . . . , \x n ) > satisfying 



AQ + QA T 4 
±QK T Bl + 

(C q 



DgpMBT 



B U KQ \ 
B v MB T p ) 
+ D qu K)Q 



BpMDj p -\ 
-(M 



Q(c q + D qu xy j 



D qp MD T qp ) 



<0. 



This condition leads to a simple frequency-domain interpretation for quadratic stabi- 
lizability of DNLDIs: Let H denote the transfer matrix from p to q for the LTI system 
x = (A + B u K)x + Bpp, q= {C q + D qu K)x + D qp p. Then the DNLDI is quadratically 
stabilizable if there exists K such that for some diagonal positive-definite matrix M, 
\\M~ 1 ' 2 HM 1 ' 2 \\ 00 < 1. There is no analytical method for checking this condition. 

With Y = KQ, we conclude that the NLDI is quadratically stabilizable if there 
exist Q > 0, M > and diagonal, and Y such that 



B p MB T p 



I AQ + QA T 
V +B u Y + Y T Bl J 
D qp MB T p + C q Q + D qu Y 



BpMDl 



(M 



QC? + Y^D? 



D q pMDTp) 



<0. 



As before, we can eliminate the variable Y using the elimination procedure of §2.6.2, 
and obtain equivalent LMIs in fewer variables. 

7.2.2 Holdable ellipsoids 

Just as quadratic stability can also be interpreted in terms of invariant ellipsoids, we 
can interpret quadratic stabilizability in terms of holdable ellipsoids. We say that the 
ellipsoid 

£={£eR" |e T Q _1 e<l} 
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is holdable for the system (7.3) if there exists a state-feedback gain K such that £ 
is invariant for the system (7.3) with u = Kx. Therefore the LMIs described in the 
previous section also characterize holdable ellipsoids for the system (7.3). 

With this parametrization of quadratic stabilizability and holdable ellipsoids as 
LMIs in Q > and Y, we can solve various optimization problems, such as finding 
the coordinate transformation that minimizes the condition number of Q, imposing 
norm constraints on the input u = Kx, etc. 

Remark: In Chapter 5, we saw that quadratic stability and invariant ellipsoids 
were characterized by LMIs in a variable P > and also its inverse Q — P _1 . 
In contrast, quadratic stabilizability and ellipsoid holdability can be expressed 
as LMIs only in variables Q and Y; it is not possible to rewrite these LMIs as 
LMIs with Q~ l as a variable. This restricts the extension of some of the results 
from Chapter 5 (and Chapter 6) to state-feedback synthesis. As a rule of thumb, 
results in the analysis of LDIs that are expressed as LMIs in the variable Q can 
be extended to state-feedback synthesis, with a few exceptions; results expressed 
as LMIs in P are not. For example, we will not be able to compute the minimum 
volume holdable ellipsoid that contains a given polytope V (see problem (5.33)) 
as an optimization problem over LMIs; however, we will be able to compute the 
minimum diameter holdable ellipsoid (see problem (5.34)) containing V. 



7.2.3 Constraints on the control input 

When the initial condition is known, we can find an upper bound on the norm of 
the control input u(t) = Kx(t) as follows. Pick Q > and Y which satisfy the 
quadratic stabilizability condition (either (7.9), (7.12) or (7.13)), and in addition 
x(0) T Q~ 1 x(0) < 1. This implies that x(t) belongs to £ for all t > 0, and conse- 
quently, 

max||w(i)|| = max t > ||yQ- 1 x(t)|| 

< max ie j ||yQ _1 x|| 

= \ m UQ~ 1/2 Y T YQ-^). 

Therefore, the constraint ||w(t)|| < \i is enforced at all times t > if the LMIs 



1 a;(0) T 
x(0) Q 



>0, 



Y T 

2/ 



Y n z I 



> 



(7.16) 



hold, where Q > and Y satisfy the stabilizability conditions (either (7.9), (7.12) 
or (7.13)). 

We can extend this to the case where x(0) lies in an ellipsoid or polytope. For 
example suppose we require (7.16) to hold for all ||x(0|| < 1. This is easily shown to 
be equivalent to 



> I, 



Q Y T 
Y fi 2 I 



> 0. 



As another extension we can handle constraints on 

II "(Oil max = max|wj(i)| 
This electronic version is for personal use and may not be duplicated or distributed. 



104 



Chapter 7 State-Feedback Synthesis for LDIs 



in a similar manner: 

max||u(i)|| max = max t > ||y<2 _1 x(i)|| max 

< niax ie£ ||yQ _1 x|| max 
= max, (FQ- 1 ^ . 

Therefore, the constraint ||u(i)|| max < fi for t > is implied by the LMI 



1 x{0) T 
x(0) Q 



>0, 



X Y 

Y T Q 



> 0, X u < /i 2 , 



where once again, Q > and V satisfy the stabifizability conditions (either (7.9), 
(7.12) or (7.13)). 



7.3 Input-to-State Properties 

We next consider the LDI 

x = A(t)x + B w (t)w + B u (t)u. 



(7.17) 



• LTI systems: For LTI systems, equation (7.17) becomes x = Ax + B w w + B u u. 

• POLYTOPIC LDIS: PLDIs are given by x = A(t)x + B w (t)w + B u (t)u, where 
[A(t) B w (t) B u (t)]G Co {[i4i B Wtl B Utl ],...,[A L B W . L B U>L ]}. 

• Norm-bound LDIs: For NLDIs, equation (7.17) becomes 

X = Ax + B u u + B p p + B w w, q = C q x + D qu u 

p=A(t)q, ||A(t)||<l 

which we can also express as 

x = Ax + B u u+ B p p+ B w w, q = C q x + D qu u, p T p<q T q- 

• Diagonal Norm-bound LDIs: For DNLDIs, equation (7.17) becomes 

x = Ax + B u u + B p p + B w w, q = C q x + D qu u 

Pi = Si(i)qi, \Si(t)\<l, i = l,...,L. 

Equivalently, we have 

X = Ax + B u u + B p p + B w w, q = C q x + D qu u, 

\Pi\ < \Qi\, i = l,...,n q . 



7.3.1 Reachable sets with unit-energy inputs 

For the system (7.17) with u = Kx, the set of states reachable with unit energy is 
defined as 



ft„o = < 



x(T) 



x, w, u satisfy (7.17), u = Kx, x(0) = 0, 

r-T 

w T w dt<l, T > 



> . 
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We will now derive conditions under which there exists a state-feedback gain K guar- 
anteeing that a given ellipsoid £ = {£ € R" | £ T <3 _1 £ < 1} contains 7^ ue . 

• LTI SYSTEMS: From §6.1.1, the ellipsoid £ contains TZ UC for the system (7.4) for 
some state-feedback gain K if K satisfies 

AQ + QA T + B U KQ + QK T B T U + B W B T W < 0. 

Setting KQ = Y , we conclude that £ D TZ ue for some K if there exist Y such that 

QA T + AQ + B U Y + Y T Bl + B W B T W < 0. 

For any Q > and Y satisfying this LMI, the state- feedback gain K = YQ^ 1 is such 
that the ellipsoid £ contains the reachable set for the closed- loop system. 

Using the elimination procedure of §2.6.2, we eliminate the variable Y to obtain 
the LMI in Q > and the variable a: 



QA T + AQ- oB u B T u + B w Bl < 0. 



(7.18) 



For any Q > and a satisfying this LMI, a corresponding state-feedback gain is given 
by K = —(a/2)B^Q^ 1 . Another equivalent condition for £ to contain lZ ue is 



Bl (AQ + QA T + B w Bl) B u < 0, 
where B u is an orthogonal complement of B u . 



(7.19) 



• Polytopic LDIS: For PLDIs, £ contains 7?. U e for some state-feedback gain K if 
there exist Q > and Y such that the following LMI holds (see LMI (6.9)): 

QAf + A t Q + B Uti Y + Y T B^ + B w ^ i < 0, * = 1, . . . , L. 



• Norm-bound LDIs: For NLDIs, £ contains lZ ue for some state-feedback gain K 
if there exist /i > and Y such that the following LMI holds (see LMI (6.11)): 



QA T + AQ + B U Y + Y T Bl + B w Bl + fiB p B^ {C q Q + D qu Y) T 
C q Q + D qu Y -ill 



< 0. 



We can eliminate Y to obtain the LMI in Q > and a that guarantees that £ contains 
the reachable set lZ uo for some state-feedback gain K: 



AQ + QA T - oB u Bl 
+[iBpB p + B W B W 



QC q + fiB p D qp - aB u D qu 



C q Q + fiD qP B£ - oD qu Bi -n(I - D qp D J qp ) - aD qu D\ 



<0. 



(7.20) 



The corresponding state- feedback gain is K = (— (t/2)B^Q 1 . 

An equivalent condition is expressed in terms of G, the orthogonal complement 
0i[BlD T qu \ T : 



G 1 



QCJ + nB p D 



G <0. 



/ AQ + QA T 
{ +iiB p B T v + B w Bl 

C q Q + \iB qp Bl -fi{I - D qp D' J qpJ 
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• Diagonal Norm-bound LDIs: For DNLDIs, £ contains H uc for some state- 
feedback gain K if there exist Q > 0, M > and diagonal, and Y such that the 
following LMI holds (see LMI (6.12)): 



QA T + AQ + B U Y + Y T Bl + B W B? + B p MB T v (C q Q + D qu Y) T 
C q Q + D qu Y -M 



< 0. 



Using the elimination procedure of §2.6.2, we can eliminate the variable Y to obtain 
an equivalent LMI in fewer variables. 

Using these LMIs (see §6.1.1 for details): 

• We can find a state-feedback gain K such that a given point Xq lies outside the 
set of reachable states for the system (7.3). 

• We can find a state- feedback gain K such that the set of reachable states for the 
system (7.3) lies in a given half-space. This result can be extended to check if 
the reachable set is contained in a polytope. In this case, in contrast with the 
results in Chapter 6, we must use the same outer ellipsoidal approximation to 
check different faces. (This is due to the coupling induced by the new variable 
Y = KQ.) 



7.3.2 Reachable sets with componentwise unit-energy inputs 

With u = Kx, the set of reachable states for inputs with componentwise unit-energy 
for the system (7.17) is defined as 



n„ 



A 



x(T) 



x, w, u satisfy (7.17), u = Kx, x(0) = 0, 



w i Wi dt < 1 , 



T>0 



We now consider the existence of the state-feedback gain K such that the ellipsoid 
£ = {£ G R n t, T Q l ti < 1 } contains 7^ ucc for LTI systems and LDIs. 

• LTI SYSTEMS: From LMI (6.15), £ D TZ UCC for the LTI system (7.4) if there exists 
diagonal R > with unit trace such that the LMI 

QA T + AQ + B U Y + Y T B T U B w 

Bl -R ~ ' 



• Polytopic LDIs: For PLDIs, £ D 7?. ucc for some state-feedback gain K if there 
exist Q > 0, Y(= KQ), and a diagonal R > with unit trace (see LMI (6.17)), such 
that 



QAj + A t Q + B Uyi Y + Y 1 B^ B Wii 
B„„ , —R 



< 0, i = l,. 



L. 



• Norm-bound LDIs: For NLDIs, £ D 7?. uce for some state-feedback gain K if there 
exist Q > 0, (i > 0, Y and a diagonal R > with unit trace (see LMI (6.19)), such 
that 



QA T + Y T Bl + AQ + B U Y + ^iB p B^ QC^ + Y T D^ U B v 



C, 










-R 



< 0. 
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Remark: For LTI systems and NLDIs, it is possible to eliminate the variable Y . 



7.3.3 Reachable sets with unit-peak inputs 

For a fixed state-feedback gain K, the set of states reachable with inputs with unit- 
peak is defined as 



TZ up = { x{T) 



x, w, u satisfy (7.17), u = Kx, x(0) = 0, 
w{t) T w{t) < 1, T>0 



• LTI SYSTEMS: From condition (6.25), £ contains 7?. ucc for the LTI system (7.4) if 
there exists a > such that 

AQ + QA T + B U Y + Y T Bl + B w B^/a + aQ<0, 

where Y = KQ. 

• Polytopic LDIS: For PLDIs, £ D H ucc if there exist Q > and Y (see condi- 
tion (6.28)) such that 



AiQ + QAf + B Uyi Y + Y 1 B^ + aQ + B w ^B 1 wA /a < 



for i = 1, . . . , L. 

• Norm-bound LDIs: From Chapter 6, £ contains 7?. uco if there exist Q > 0, Y, 
a > and fi > such that 



/ AQ + QA T + aQ 

\ +B U Y + Y T B U + fiBpB? + B w Bl/a 
C q Q + D qu Y 

This condition is an LMI for fixed a. 



(C q Q + D qu Y) T 
-Hi 



< 0. 



Remark: Again, for LTI systems and NLDIs, it is possible to eliminate the 
variable Y. 



7.4 State-to-Output Properties 

We consider state-feedback design to achieve certain desirable state-to-output prop- 
erties for the LDI (4.5). We set the exogenous input w to zero in (4.5), and consider 

x = A(t)x + B u (t)u, z = C z (t)x. 



• LTI systems: For LTI systems, the state equations are 
x = Ax + B u u, z = C z x + D zu u. 



(7.21) 



• Polytopic LDIs: For PLDIs we have 

G Co 



A(t) B u (t) 
C z (t) D zu (t) 



A\ B u y 



B 



u , L 



C Z ,L -Dzm,L 
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• Norm-bound LDIs: NLDIs are given by 

x = Ax + B u u + B p p, z = C z x + D zu u 

q=C q x + D qu u p=A{t)q, ||A(£)||<1, 

which can be rewritten as 

x = Ax + B u u + B p p, z = C z x + D zu u 

q = C q x + D qu u, p T p < q T q. 



• Diagonal Norm-bound LDIs: Finally, DNLDIs are given by 

x = Ax + B u u + B p p, z = C z x + D zu u, q = C q x + D qu u, 

Pi = Si(i)qi, \Si(t)\<l, i = l,...,L. 
which can be rewritten as 

x = Ax + B u u + B p p, z = C z x + D zu u, q = C q x + D qu u, 



\Pi\ < h\, i = !,■ 



, n„ 



7.4.1 Bounds on output energy 

We first show how to find a state-feedback gain K such that the output energy (as 
defined in §6.2.1) of the closed-loop system is less than some specified value. We 
assume first that the initial condition xq is given. 

• LTI SYSTEMS: We conclude from LMI (6.39) that for a given state-feedback gain 
K, the output energy of system (7.21) does not exceed XqQ~ x xo, where Q > and 
Y = KQ satisfy 

AQ + QA T + B U Y + Y T B T U (C Z Q + D ZU Y) T 
C Z Q + D ZU Y -I 

Regarding Fasa variable, we can then find a state-feedback gain that guarantees an 
output energy less than 7 by solving the LMIP x^Q~ 1 Xq < 7 and (7.22). 

Of course, inequality (7.22) is closely related to the classical Linear- Quadratic 
Regulator (LQR) problem; see the Notes and References. 

• Polytopic LDIs: In this case, the output energy is bounded above by XqQ~ 1 x , 
where Q satisfies the LMI 

AiQ + QAJ 

rT U T 



<o, 



(7.22) 



+B U ^Y + Y 1 Bi ti 



(C Xti Q + D mti Y)' 

-I 



<0, 



1,.. 



for some Y. 



• Norm-bound LDIs: In the case of NLDIs, the output energy is bounded above 
by XqQ~ x Xo, for any Q > 0, Y and p> such that 

AQ + QA T + B U Y\ {CzQ + DzuY) t {CqQ + DquY )T 



+Y T Bl + ^B p Bl 
C Z Q + D ZU Y 
C q Q + D qu Y 




-Hi 



< 0. (7.23) 
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Remark: As before, we can eliminate the variable Y from this LMI. 



• Diagonal Norm-bound LDIs: Finally, in the case of DNLDIs, the output energy 
is bounded above by x T Q~ x xo, for any Q > 0, Y and M > and diagonal such that 



AQ + QA T + B U Y 

+Y T BT, + B v MBl 
C z 



(C Z Q + D ZU Y) T (C q Q + D qu YY J 



C„ 



D ZU Y 
D qu Y 








-M 



< 0. 



Given an initial condition, finding a state-feedback gain K so as to minimize the 
upper bound on the extractable energy for the various LDIs is therefore an EVP. We 
can extend these results to the case when xo is specified to lie in a polytope or an 
ellipsoid (see §6.2.1). If x is a random variable with Exo^o" = -^Oi EVPs that yield 
state-feedback gains that minimize the expected value of the output energy can be 
derived. 

7.5 Input-to-Output Properties 

We finally consider the problem of finding a state-feedback gain K so as to achieve de- 
sired properties between the exogenous input w and the output z for the system (7.1). 
As mentioned in Chapter 6, the list of problems considered here is far from exhaustive. 

7.5.1 L 2 and RMS gains 

We seek a state- feedback gain K such that the L2 gain 

'NI2 



sup ||Z|| 2 

IMI 2 =i 



sup 



IM| 2 /o l|W||2 
of the closed-loop system is less than a specified number 7. 

• LTI systems: As seen in 6.3.2, the L2 gain for LTI systems is equal to the Hoc 
norm of the corresponding transfer matrix. From §6.3.2, there exists a state-feedback 
gain K such that the L 2 gain of an LTI system is less than 7, if there exist K and 
Q > such that, 



(A + B U K)Q + Q{A + B U K) T 

+B W B W 

(C z + D ZU K)Q 



Q(C Z + D ZU K) T 

-1 2 I 
Introducing Y = KQ, this can be rewritten as 

AQ + QA T + B U Y + Y T B T U + B W B T W (C Z Q + D ZU Y) T 



< 0. 



(7.24) 



C Z Q + D ZU Y 



-1 2 I 



< 0. 



(7.25) 



In the case of LTI systems, this condition is necessary and sufficient. Assuming 
C^D ZU = and D^ U D ZU invertible, it is possible to simplify the inequality (7.25) 
and get the equivalent Riccati inequality 

AQ + QA T - BuiD^D^Bl + B W B T W + QCjC z Q/-/ 2 < 0. (7.26) 
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The corresponding Riccati equation is readily solved via Hamiltonian matrices. The 
resulting controller, with state-feedback gain K = — (D zu D zu )~ 1 B^, yields a closed- 
loop system with Hoo-norm less than 7. 

• Polytopic LDIs: From §6.3.2, there exists a state-feedback gain such that the 
L2 gain of a PLDI is less than 7 if there exist Y and Q > such that 



A t Q + QAf + B Uii Y 

+ YTB u,i + B w,i B w,i 



(C z ^Q + D zu , l Y) T 

-1 2 I 



< 0. 



(7.27) 



• Norm-bound LDIs: For NLDIs, the LMI that guarantees that the L 2 gain is less 
than 7 for some state-feedback gain K is 



AQ + QA T \ 

+B u Y + Y T Bl 
+B W B W + [iB p B p ) 
C Z Q + D ZU Y 


(C Z Q + D ZU Y) T 

-1 2 I 


(C q Q + D qu Y) 



C q Q + D qu Y 





-Hi 



< 0. 



(7.28) 



We can therefore find state-feedback gains that minimize the upper bound on the 
I/2 gain, provable with quadratic Lyapunov functions, for the various LDIs by solving 
EVPs. 



7.5.2 Dissipativity 

We next seek a state-feedback gain K such that the closed-loop system (7.2) is passive; 
more generally, assuming D zw (t) is nonzero and square, we wish to maximize the 
dissipativity, i.e., rj satisfying 



(w z — r/w wj dt > 



for all T > 0. 

• LTI systems: Substituting Y = KQ, and using LMI (6.60), we conclude that the 
dissipation of the LTI system is at least 77 if the LMI in rj, Q > and Y holds: 



AQ + QA T + B U Y + Y T B T U B w - QC T Z - Y T D T ZU 
Bl - C Z Q - D ZU Y 2 V I - (D zw + DjJ 



< 0. 



• Polytopic LDIs: From (6.62), there exists a state-feedback gain such that the 
dissipation exceeds rj if there exist Q > and Y such that 



A t Q + QA\ 



+B u , t Y + Y 1 B 



T U T 



Bn 



QCT Z -Y T D Z 



B w ; — C z ^i 



D zu>i Y 2 V I - (D z 



Di 



< 0. 
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• Norm-bound LDIs: From (6.64), there exists a state-feedback gain such that the 
dissipation exceeds r\ if there exist Q > and Y such that 



< 



We can therefore find state-feedback gains that maximize the lower bound on the 
dissipativity, provable with quadratic Lyapunov functions, for the various LDIs by 
solving EVPs. 

Remark: As with §6.3.4, we can incorporate scaling techniques into many of the 
results above to derive componentwise results. Since the new LMIs thus obtained 
are straightforward to derive, we will omit them here. 



AQ + QA T + B U Y \ 
+Y T B T + ^B q B^ ) 


( B w - QC T Z \ 
V ~Y T DJ U ) 


(C q Q + D qu Y) T 


( Bl - C Z Q \ 
V ~D ZU Y J 


-{D zw + D T zw -2 n I) 





C q Q + D qu Y 





-til 



7.5.3 Dynamic versus static state-feedback controllers 

A dynamic state-feedback controller has the form 



Ax- 



JDyX^ 



\_/ u X ~r J-Juy^i 



(7.29) 



where A e R rxr , B y e R rx ™, C u e R"" xr , D uy e R"" x ". The number r is called 
the order of the controller. Note that by taking r = 0, this dynamic state-feedback 
controller reduces to the state-feedback we have considered so far (which is called 
static state- feedback in this context). 

It might seem that the dynamic state-feedback controller (7.29) allows us to meet 
more specifications than can be met using a static state-feedback. For specifications 
based on quadratic Lyapunov functions, however, this is not the case. For exam- 
ple, suppose there does not exist a static state-feedback gain that yields closed-loop 
quadratic stability. In this case we might turn to the more general dynamic state- 
feedback. We could work out the more complicated LMIs that characterize quadratic 
stabilizability with dynamic state-feedback. We would find, however, that these more 
general LMIs are also infeasible. See the Notes and References. 



7.6 Observer-Based Controllers for Nonlinear Systems 



We consider the nonlinear system 
x = f(x) + B u u, 



y — C y x, 



(7.30) 

R p is the control variable, 
and y : R+ — ► R 9 is the measured or sensed variable. We assume the function 
/ : R™ -> R n satisfies /(0) = and 



where x : R 



+ 



R™ is the state variable, u : Rj 



Ox 



G Co{A 1 ,...,A M } 



where A\, . . . , Al are given, which is the same as (4.15). 

We look for a stabilizing observer-based controller of the form 

x = f(x) + B u u + L{C y x — y), u = Kx, (7-31) 
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i.e., we search for the matrices K (the estimated- state feedback gain) and L (the 
observer gain) such that the closed-loop system 



x 

X 



~ LlKjyX 



m 

-Six) 



B u Kx 
- (B U K 



J-/Lyy )X 



(7.32) 



is stable. The closed-loop system (7.32) is stable if it is quadratically stable, which is 
true if there exists a positive-definite matrix PgR" " such that for any nonzero 
trajectory x, x, we have: 



<0. 



In the Notes and References, we prove that this is true if there exist P, Q, Y, and W 
such that the LMIs 



d 


X 


1 

p 


X 


~dt 


X 


X 



> 0, A t Q + QAf + B U Y + Y T Bl < 0, 



1. 



and 



P > 0, AfP + PA, + WC V 



C^ W 1 K 0, 



l, 



M 



,M 



(7.33) 



(7.34) 



hold. To every P, Q, Y , and W satisfying these LMIs, there corresponds a stabilizing 
observer-based controller of the form (7.31), obtained by setting K = YQ~ X and 
L = P- X W. 

Using the elimination procedure of §2.6.2, we can obtain equivalent conditions in 
which the variables Y and W do not appear. These conditions are that some P > 
and Q > satisfy 



and 



AiQ + QAf <aB u Bl, i = l,...,M 



AjP + PAi<nC^C y , i = l,...,M 



(7.35) 



(7.36) 



for some a and fi. By homogeneity we can freely set a = /j, = 1. For any P > 0, Q > 
satisfying these LMIs with a = fi = 1, we can obtain a stabilizing observer-based 
controller of the form (7.31), by setting K = -{l/2)B^Q^ 1 and L = -{l/2)p- 1 C y . 
Another equivalent condition is that 

Bl (A t Q + QAf ) B u < 0, t = l,...,M 



and 



C y (AfP + PA,) C T y < 0, i = 1 



,M 



hold for some P > 0, Q > 0, where B u and C y are orthogonal complements of B u and 
Cy , respectively. If P and Q satisfy these LMIs, then they satisfy the LMIs (7.36) 
and (7.35) for some a and /!. The observer-based controller with K 
and L = —(fi/2)P~ 1 C y stabilizes the nonlinear system (7.30). 



-(<y/2)BlQ-^ 



Notes and References 



Lyapunov functions and state-feedback 

The extension of Lyapunov's methods to the state-feedback synthesis problem has a long 
history; see e.g., [Bar70a, Lef65]. It is clearly related to the theory of optimal control, in 
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which the natural Lyapunov function is the Bellman-Pontryagin "min-cost-to-go" or value 
function (see [Pon61, Pon62]). We suspect that the methods described in this chapter can 
be interpreted as a search for suboptimal controls in which the candidate value functions 
are restricted to a specific class, i.e., quadratic. In the general optimal control problem, 
the (exact) value function satisfies a partial differential equation and is hard to compute; 
by restricting our search to quadratic approximate value functions we end up with a low 
complexity task, i.e., a convex problem involving LMIs. 

Several approaches use Lyapunov-like functions to synthesize nonlinear state-feedback control 
laws. One example is the "Lyapunov Min-Max Controller" described in [Gut79. GP82, 
COR85, CL90, Wei94]. See also [Zin90, SC91, Rya88, GR88]. 

Quadratic stabilizability 

The term "quadratic stabilizability" seems to have been coined by Hollot and Barmish 
in [HB80], where the authors give necessary and sufficient conditions for it; see also [Lei79, 
Gut79, BCL83, PB84, COR.85]. Petersen [Pet85], shows that there exist LDIs that are 
quadratically stabilizable, though not via linear state-feedback; see also [SP94e]. Hol- 
lot [Hol87] describes conditions under which a quadratically stabilizable LDI has infinite 
stabilizability margin; see also [SRC93]. Wei [Wei90] derives necessary and sufficient "sign- 
pattern" conditions on the perturbations for the existence of a linear feedback for quadratic 
stability for a class of single-input uncertain linear dynamical systems. Petersen [Pet87b] 
derives a Riccati-based approach for stabilizing NLDIs, which is in fact the state-feedback 
Hoo equation (7.28) (this is shown in [Pet87a]). 

The change of variables Q — P _1 and Y — KP _1 , which enables the extension of many 
of the results of Chapters 5 and 6 to state-feedback synthesis, is due to Bernussou, Peres 
and Geromel [BPG89a]. See also [GPB91, BPG89B, BPG89A, PBG89, PGB93]. This 
change of variables, though not explicitly described, is used in the paper by Thorp and 
Barmish [TB81]; see also [HB80, Bar83, Bar85]. For a recent and broad review about 
quadratic stabilization of uncertain systems, see [COR94] . 

In [Son83], Sontag formulates a general theorem giving necessary and sufficient Lyapunov 
type conditions for stabilizability. For LTI systems his criteria reduce to the LMIs in this 
chapter. 

Quadratic Lyapunov functions for proving performance 

The addition of performance considerations in the analysis of LDIs can be traced back to 
1955 and even earlier, when Letov [Let61] studied the performance of unknown, nonlinear 
and possibly time- varying control systems and calls it "the problem of control quality" . At 
that time, the performance criteria were decay rate and output peak deviations for systems 
subject to bounded peak inputs (see [Let61, p. 234] for details). 

In the case of LTI systems, adding performance indices allows one to recover many classical 
results of automatic control. The problem of maximizing decay rate is discussed in great 
detail in [Yan92]; other references on decay rates and quadratic stability margins are [Gu92b, 
Gu92a, GPB91, Gu92b, Sko91b, PZP+92, EBFB92, Cor90]. Arzelier et al. [ABG93] 
consider the problem of robust stabilization with pole assignment for PLDIs via cutting-plane 
techniques. 

The problem of approximating reachable sets for LTI systems under various assumptions on 
the energy of the exogenous input has been considered for example by Skelton [SZ92] and 
references therein; see also the Notes and References of the previous chapter. See also [OC87]. 

The idea that the output variance for white noise inputs (and related LQR-like performance 
measures) can be minimized using LMIs can be found in [BH88A, BH89, RK91, KR91, 
KKR93, BB91, FBBE92, RK93, SI93]. Peres, Souza and Geromel [PSG92, PG93] con- 
sider the extension to PLDIs. The counterpart for NLDIs is in Petersen, McFarlane and 
Rotea [PM92, PMR93], as well as in Stoorvogel [Sto91]. 

The problem of finding a state-feedback gain to minimize the scaled L2 gain for LTI sys- 
tems is discussed in [EBFB92]. The L2 gain of PLDIs has been studied by Obradovic and 
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Valavani [OV92], by Peres, Geromel and Souza [PGS91] and also by Ohara, Masubuchi and 
Suda [OMS93]. NLDIs have been studied in this context by Petersen [Pet89], and DeSouza 
et al. [XFdS92, XdS92, WXdS92, DSFX93], who point out explicitly that quadratic sta- 
bility of NLDIs with an L2 gain bound is equivalent to the scaled Hoc condition (6.54); see 
also [ZKSN92, Gu93]. For LTI systems, it is interesting to compare the inequality (7.25) 
which provides a necessary and sufficient condition for a system to have L2 gain less than 
7 with the corresponding matrix inequality found in the article by Stoorvogel and Trentel- 
man [ST90]. In particular, the quadratic matrix inequality found there (expressed in our 
notation) is 



A T P + PA + j 2 PB w BlP + C z r C z PB u +CjD z , L 
BlP + D T ZU C Z D T ZU D ZU 



> 0, 



which does not possess any obvious convexity property. We note that the Riccati inequal- 
ity (7.26) is also encountered in full-state feedback H™ control [Pet87a]. Its occurrence in 
control theory is much older; it appears verbatim in some articles on the theory of nonzero- 
sum differential games; see for example Starr and Ho [SH68], Yakubovich [Yak70, Yak71] 
and Mageirou [Mag76]. 

Kokotovic [Kok92] considers stabilization of nonlinear systems, and provides a motivation for 
rendering an LTI system passive via state-feedback (see §6.3.3). In [PP92], the authors give 
analytic conditions for making an LTI system passive via state-feedback; see also [SKS93]. 

Finally, we mention an instance of a Lyapunov function with a built-in performance criterion. 
In the book by Aubin and Cellina [AC84, §6], we find: 

We shall investigate whether differential inclusions x e F(x(t)), x(0) = xo 
have trajectories satisfying the property 



Vt>s, V(x(t))-V(x(s)) + W{x(T),x(r))dr<0. (7.37) 

J s 

We shall then say that a function V [■ ■ ■] satisfying this condition is a Lyapunov 
function for F with respect to W . 

Similar ideas can be found in [CP72]. We shall encounter such Lyapunov functions in §8.2. 
Inequality (7.37) is often called a "dissipation inequality" [Wil71b]. 

The problem of minimizing the control effort given a performance bound on the closed-loop 
system can be found in the articles of Schomig, Sznaier and Ly [SSL93], and Grigoriadis, 
Carpenter, Zhu and Skelton [GCZS93]. 

LMI formulation of LQR problem 

For the LTI system x — Ax + B u u, z = C z x + D zu u, the Linear- Quadratic Regulator (LQR) 
problem is: Given an initial condition x(0), find the control input u that minimizes the output 
energy J°° z T z dt. We assume for simplicity that (A,B,C) is minimal, Dj n D zu is invertible 
and D T ZU C Z = 0. 

It turns out that the optimal input u can be expressed as a constant state-feedback u — Kx, 
where K — —(D ZU D ZU )~ B u P are and P are is the unique positive-definite matrix that satisfies 
the algebraic Riccati equation 

A T P SiTe + P„ e A - P are S„(DLc u )" 1 srPare + CjC z = 0. (7.38) 

The optimal output energy for an initial condition x(0) is then given by x(0) T P a r e a;(0). With 
Qare = P^re i we can rewrite (7.38) as 

AQ are + Q^A T - B^Dl^D^Bl + Q^CjC.Q^e = 0, (7.39) 
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and the optimal output energy as x(0) T Q ar e2:(0). 

In section §7.4.1, we considered the closely related problem of finding a state- feedback gain 
K that minimizes an upper bound on the energy of the output, given an initial condition. 
For LTI systems, the upper bound equals the output energy, and we showed that in this case, 
the minimum output energy was given by minimizing x(0) Q~ x(0) subject to Q > and 



AQ + QA T + B U Y + Y T Bl 
C Z Q + D ZU Y 



IP, 



+ D ZU Y) T 
-I 



< 0. 



(7.40) 



Of course, the optimal value of this EVP must equal the optimal output energy given via 
the solution to the Riccati solution (7.39); the Riccati equation can be thus be interpreted as 
yielding an analytic solution to the EVP. We can derive this analytic solution for the EVP via 
the following steps. First, it can be shown, using a simple completion-of-squares argument 



that LMI (7.40) holds for some Q > and Y if and only if it holds for Y = -(D'i 
in this case, Q > must satisfy 



,Dz 



~ x Bl 



QA T +AQ + QCfC z Q - B u (D T ZU D Z 



(7.41) 



Next, it can be shown by standard manipulations that if Q > satisfies (7.41), then Q < Q a re- 
Therefore for every initial condition x(0), we have x(0) T Q~ T \x(0) < x(0) T Q~ 1 x(0), and 
therefore the optimal value of the EVP is just x(0) T Q^ r ],x(0), and the optimal state-feedback 
gain is K opt = Y opt Q- r l = -(D^D 211 )- 1 BjP are . 

It is interesting to note that with P — Q _1 , the EVP is equivalent to minimizing x(0) T Px(0) 
subject to 



A T P + PA + C T Z C Z - PB U (Dj u D zu ) X BZP < 0, 



(7.42) 



which is not a convex constraint in P. However, the problem of maximizing x(0) Px(0) 
subject P > and the constraint 

A T P + PA- PB u (D T zu D zv )- x BlP + C z Cj > 0, 

which is nothing other than (7.42), but with the inequality reversed, is an EVP; it is well- 
known (see for example [Wil71b]) that this EVP is another formulation of the LQR problem. 



Static versus dynamic state-feedback 

An LTI system can be stabilized using dynamic state-feedback if and only if it can be stabi- 
lized using static state-feedback (see for example, [Kai80]). In fact, similar statements can 
be made for all the properties and for all the LDIs considered in this chapter, provided the 
properties are specified using quadratic Lyapunov functions. We will demonstrate this fact 
on one simple problem considered in this chapter. 

Consider the LTI system 



x = Ax + B u u + B w w, x{0) — 0, 



z = C z x. 



(7.43) 



From §7.5.1, the exists a static state-feedback u = Kx such that the L2 gain from w to z 
does not exceed 7 if the LMI in variables Q > and Y is feasible: 



AQ + QA T + B U Y + Y T BZ + B W BZ QCj 
C Z Q - 7 2 / 



< 0. 
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(In fact this condition is also necessary, but this is irrelevant to the current discussion. 
Eliminating Y, we get the equivalent LMI in Q > and scalar a: 



A Q + QA T + B w Bl + QC % aQ < oB u Bl. 



(7.44) 



We next consider a dynamic state-feedback for the system: 
x c — A c x c + B c x, x c (0) — 0, u — C c x c + D c x. 



(7.45) 



We will show that there exist matrices A c , B c , C c and D c such that the L2 gain of the 
system (7.43,7.45) does not exceed 7 if and only if the LMI (7.44) holds. This will establish 
that the smallest upper bound on the L2 gain, provable via quadratic Lyapunov functions, 
is the same irrespective of whether a static state-feedback or a dynamic state-feedback is 
employed. 

We will show only the part that is not obvious. With state vector x c \ — [x T x^] T , the 
closed-loop system is 

Xci = A c \Xc\ + Bciu, a; c i(0) = 0, z — CciX c i, 

where A c i — Abi g + -Bbig-^big with 



^bi g 



B c 



A 



, B big = 




I 


B, 




B w 




1 


Cci = [C z 0] . 





^big 



B c A c 
D c C c 



From §6.3.2, the L2 gain of the closed-loop system does not exceed 7 if the LMI in Pbig > 
is feasible: 



A£ ig P + P hig A + PB hig K hig + Kl ig Bl ig P + CjiCd PB rA 
BlP - 7 2 / 



< 0. 



Eliminating Khi g from this LMI yields two equivalent LMIs in Pbig > and a scalar r. 



^bi g F + Pbig.4 -tI + ClCd PB c i 

BlP -l 2 I _ 


<o, 


Al g P + P big A - TPB hig Bl- lg P + ClC A PB C] 


BlP 


- 7 2 /_ 



< 0. 



It is easy to verify that the first LMI holds for large enough r, while the second reduces to 
r > and 

AQu + QnA T + ^L^L + QnaCjQn < tB u bI, 

T 

where Qn is the leading n x n block of P big - The last LMI is precisely (7.44) with the change 
of variables Q — 7 Qn and a — 7 t. 

A similar result is established [PZP + 92]; other references that discuss the question of when 
dynamic or nonlinear feedback does better than static feedback include Petersen, Khar- 
gonekar and Rotea [Pet85, KPR88, RK88, KR88, Pet88, RK89]. 
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Dynamic output-feedback for LDIs 

A natural extension to the results of this chapter is the synthesis of output-feedback to meet 
various performance specifications. We suspect, however, that output-feedback synthesis 
problems have high complexity and are therefore unlikely to be recast as LMI problems. 
Several researchers have considered such problems: Steinberg and Corless [SC85], Galimidi 
and Barmish [GB86], and Geromel, Peres, de Souza and Skelton [PGS93]; see also [HCM93]. 

One variation of this problem is the "reduced-order controller problem" , that is, finding dy- 
namic output-feedback controllers with the smallest possible number of states. It is shown 
in [PZPB91, EG93, PAC94] that this problem can be reduced to the problem of minimizing 
the rank of a matrix P > subject to certain LMIs. The general problem of minimizing the 
rank of a matrix subject to an LMI constraint has high complexity; in fact, a special rank 
minimization problem can be shown to be equivalent to the NP-hard zero-one linear pro- 
gramming problem (see e.g. [Dav94]). Nevertheless several researchers have tried heuristic, 
local algorithms for such problems and report practical success. Others studying this topic 
include Iwasaki, Skelton, Geromel and de Souza [IS93B, ISG93, GdSS93, SIG93]. 

Observer-based controllers for nonlinear systems 

We prove that the LMIs (7.33) and (7.34) (in the variables P > 0, Q > 0, W and Y) ensure 
the existence of an observer-based controller of the form (7.31) which makes the system (7.30) 
quadratically stable. 

We start by rewriting the system (7.32) in the coordinates x, x — x: 



x 
x — x 



LC y (x -x) + f(x) + B u Kx 

f(x)-f(x) + LC y (x-x) 



(7-46) 



Using the results of §4.3, we can write 



x 
X — X 



€ Co {Aij | 1 < i < M, 1 < j < M) 



x 
X — X 



with 



An = 



At + B U K 




LCy 



i,j - 1, 



,M. 



Simple state-space manipulations show the system (7.32) is quadratically stable if and only if 
the system (7.46) is quadratically stable. Therefore, the system (7.32) is quadratically stable 
if there exists P > such that 



AijP + PAij <Q, M = l, 



,M. 



(7.47) 



We will now show that the LMI (7.47) has a solution if and only if the LMIs (7.33) and (7.34) 
do. 

First suppose that (7.47) has a solution P. With P partitioned as, n x n blocks, 



P = 



Pn P\i 
P\i Pii 



we easily check that the inequality (7.47) implies 

(A l + B U K) T P 11 +P 11 (A l + B u K) <0, i = l,...,M. (7.48) 
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With the change of variables Q — P lx x and Y — B U Q, the inequality (7.48) is equivalent 

to (7.33). 

Define Q = P _1 . The inequality (7.47) becomes 

AijQ + QAjjKO, i,j = l,...,M. 
Denoting by Q 22 the lower-right n x n block of Q, we see that this inequality implies 

(A l + LC y )Q 22 + Q 22 {A l + LC y ) T < 0, i=l,...,M. (7.49) 

With P - Q~z and W = PL, the inequality (7.49) is equivalent to (7.34). 

Conversely, suppose that P, Q, Y and W satisfy the LMIs (7.34) and (7.33) and define 
L — P~ 1 W , K — YQ~ l . We now prove there exists a positive A such that 

XQ- 1 
P 

satisfies (7.47) and therefore proves the closed-loop system (7.32) is quadratically stable. We 
compute 



AijP + PAij 



(At + B U K) J Q- 1 
+ Q~ 1 {A, + B U K) 

XiLCyfQ- 1 



XQ-'LCy 

(A 3 +LCy) T P 

+ P(A j+ LCy) 



for i,j — 1,...,M. Therefore, using Schur complements, the closed-loop system (7.32) is 
quadratically stable if A > satisfies 

A(Q- 1 LC„((A J + LC y ) T P + P{Aj + LC* y ))- 1 (LC !/ ) T Q- 1 ) 

-(A + BuKfQ- 1 -Q-^A + BuK) > 

for i,j = 1, . . . ,M. This condition is satisfied for any A > such that 

A min /!, > max Vi, 

l<j<M Ki<M 



where 



Hi = A min (Q" 1 LC7 1/ ((^ + LCyfP + P{A 3 + LCy))- 1 (LC y fQ- 1 ), j = l,...,M 



and 



Vi = A max ((A 1 + BuKfQ- 1 + Q~ 1 (A l + B U K)), i = l,...,M. 

Since (7.34) and (7.33) are satisfied, such a A always exists. 

Problems of observer-based controller design along with quadratic Lyapunov functions that 
prove stability have been considered by Bernstein and Haddad [BH93] and Yaz [Yaz93]. 

Gain-scheduled or parameter-dependent controllers 

Several researchers have extended the results on state-feedback synthesis to handle the case 
in which the controller parameters can depend on system parameters. Such controllers are 
called gain-scheduled or parameter-dependent. We refer the reader to the articles cited for 
precise explanations of what these control laws are, and which design problems can be recast 
as LMI problems. Lu, Zhou and Doyle in [LZD91] and Becker and Packard [Bec93, BP91] 
consider the problem in the context of NLDIs. Other relevant references include [PB92, 
PAC94, PBPB93, BPPB93, GA94, IS93A, IS93b, AGB94]. 
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Lur'e and Multiplier Methods 



8.1 Analysis of Lur'e Systems 

We consider the Lur'e system 

x = Ax + B p p+ B w w, q = C q x, 

z = C z x, pi(t) = (j>i{qi(t)), i = l,...,n p , 

where p(t) G R™ p , and the functions (pi satisfy the [0, 1] sector conditions 

< a<f>i(a) < a 2 for all ct G R, (8.2) 

or, equivalently, 

(f>i((j)(4>i(cr) - a) < for all a G R. 

The data in this problem are the matrices A, B p , B w , C q and C z . The results that 
follow will hold for any nonlinearities (pi satisfying the sector conditions. In cases 
where the (pi are known, however, the results can be sharpened. 
It is possible to handle the more general sector conditions 

otiO < <j(pi((j) < Pi(T for all a G R, 

where a, and (3i are given. Such systems are readily transformed to the form given 
in (8.1) and (8.2) by a loop transformation (see the Notes and References). 

An important special case of (8.1), (8.2) occurs when the functions (pi are linear, 
i.e., (pi(o~) = 5iO~, where Si € [0, 1]. In the terminology of control theory, this is referred 
to as a system with unknown-but-constant parameters. It is important to distinguish 
this case from the PLDI obtained with (8.1), (pi(cr) = 5i(t)o~ and 6i(t) G [0,1], which 
is referred to as a system with unknown, time-varying parameters. 

Our analysis will be based on Lyapunov functions of the form 

F(0 = ^ T ^ + 2^A, / <Pi(*)do-, (8.3) 

where Ci_ q denotes the ith row of C q . Thus the data describing the Lyapunov function 
are the matrix P and the scalars Xi, i = 1, . . . ,n p . We require P > and Aj > 0, 
which implies that V(£) > £ T P(, > for nonzero £. 

Note that the Lyapunov function (8.3) depends on the (not fully specified) non- 
linearities (pi . Thus the data P and Xi should be thought of as providing a recipe for 
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constructing a specific Lyapunov function given the nonlinearities fa. As an exam- 
ple, consider the special case of a system with unknown-but-constant parameters, i.e., 
4>i(<r) = 5i<J. The Lyapunov function will then have the form 

where A = diag(<5i, . . . , S n ) and A = diag(Ai, . . . , X n ). In other words, we are really 
synthesizing a parameter-dependent quadratic Lyapunov function for our parameter- 
dependent system. 

8.1.1 Stability 

We set the exogenous input w to zero, and seek P and Xi such that 
dV(x) 
dt 
Since 



< for all nonzero x satisfying (8.1) and (8.2). (8-4) 



^M = 2 I X -P + ^ X iPiCi , q I (Ax + B pP ) , 

condition (8.4) holds if and only if 
/ n p \ 

U T P+Y1 X i*i C i,1 I (M + BpTT) < 

for all nonzero £ satisfying 

n(^i - (H,qO < 0, i = l,...,n p . (8.5) 

It is easily shown that 

{(£,tt)|£^0, (8.5)} = {(£,tt) |^0or7r^0, (8.5)}. 

The 5-procedure then yields the following sufficient condition for (8.4): The LMI in 
P > 0, A = diag(Ai, . . . , A„J > and T = diag(n, . . . ,t„) > 



A T P + PA PB p + A T C q r A + CjT 

BTp + AC q A + TC q AC q B p + BjCjA - 2T 

holds. 

Remark: When we set A = we obtain the LMI 



< (8.6) 



A T P + PA PB P + CqT 
BlP + TC„ -2T 



<0, 



which can be interpreted as a condition for the existence of a quadratic Lyapunov 
function for the Lur'e system, as follows. Perform a loop transformation on the 
Lur'e system to bring the nonlinearities to sector [—1,1]. Lhen the LMI above 
is the same (to within a scaling of the variable T) as one that yields quadratic 
stability of the associated DNLDI; see §5.1. 



Remark: Condition (8.6) is only a sufficient condition for the existence of a Lur'e 
Lyapunov function that proves stability of system (8.1). It is also necessary when 
there is only one nonlinearity, i.e., when n p — 1. 
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8.1.2 Reachable sets with unit-energy inputs 

We consider the set reachable with unit-energy inputs, 

x, w satisfy (8.3), x(0) = 



K uc = { x(T) 
The set 

-I 

contains TZ, U . if 



w I wdt<l, T>0 



e T pe + 2^A, / 

,_-, Jo 



Ci, q £. 



<f>i(<r) da < 1 



d 

dl, 



V(x) < w w for all x and w satisfying (8.1) and (8.2). 



(8.7) 



From the <S-procedure, the condition (8.7) holds if there exists T = diag(ri, . . . , r n ) > 
such that 



A T P + PA PB P + A T C^A + C?T PB W 

E.Tp + AC q A + TC q AC q B p + B^C q r A-2T AC q B u 
BlP BlC T q A -I 



< 



holds, where A = diag(Ai, . . . ,X n ) > 0. 

Of course, the ellipsoid £ = {£ € R n | £, T P(, < 1 } contains T, so that £ D 
F 5 ?? U c- Therefore, £ gives an outer approximation of TZ UC if the nonlinearities 
are not known. If the nonlinearities fa are known, T gives a possibly better outer 
approximation. We can use these results to prove that a point xq does not belong to 
the reachable set by solving appropriate LMIPs. 

8.1.3 Output energy bounds 

We consider the system (8.1) with initial condition x(0), and compute upper bounds 
on the output energy 



J 



z z dt 



using Lyapunov functions V of the form (8.3). 
If 



dl. 



V(x) + z T z < for all x satisfying (8.1), 



(8.9) 



then J < V(x(0)). The condition (8.9) is equivalent to 

(n p \ 

(T P + J2 XiWiC itg J (A£ + B p tt) + ecJCX < 0, 

for every £ satisfying 

7Ti(7Ti - Ci, q £) < 0, i=l,...,Tlp. 
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Using the <S-procedure, we conclude that the condition (8.9) holds if there exists 
T = diag(ri, . . . , r n ) > such that 



A t p + pa + C T Z C Z PB P + A T C T q A + C T q T 
BlP + AC a A + TC AC a B v + SjCjA - IT 



< 0. (8.10) 



Since V(x(0j) < x(0) T (P + Cj AC q )x(0), an upper bound on J is obtained 
by solving the following EVP in the variables P, A = diag(Ai, . . . , A n ) and T = 
diag(n,...,r„J: 



minimize 



x(0) T (P+CjAC q )x(0) 



subject to (8.10), T>0, A>0, P>0 

If the nonlinearities </>j are known, we can obtain possibly better bounds on J by 
modifying the objective in the EVP appropriately. 

8.1.4 L 2 gain 

We assume that D zw = for simplicity. If there exists a Lyapunov function of the 
form (8.3) and 7 > such that 

— V(x) < 7 w w — z z for all x and w satisfying (8.1), (8-H) 

at 

then the L2 gain of the system (8.1) does not exceed 7. The condition (8.11) is 
equivalent to 

/ n p \ 

2 U T P + J2 AiTTjCi,, I (M + B p tt) < 1 2 w T w - i T C T z C z i 

for any £ satisfying 71^(7^ — Cj )g £) < 0, i = 1, . . . , n p . Using the 5-procedure, this is 
satisfied if there exists T = diag(ri, . . . , r n ) > such that 



A T P +PA+ CjC z PB P + A T C?A + C^T PB W 
BTp + AC q A + TC q AC q B p + B p r C q r A-2T AC q B w 
BlP BlC^A -7 2 / 



< 0. (8.12) 



The smallest upper bound on the L2 gain, provable using Lur'e Lyapunov functions, 
is therefore obtained by minimizing 7 over 7, P, A and T subject to P > 0, A = 
diag(Ai,...,A„ p ) > 0, T = diag(n, . . . ,r„ p ) > and (8.12). This is an EVP. 

8.2 Integral Quadratic Constraints 

In this section we consider an important variation on the NLDI, in which the point- 
wise constraint p(t) T p(t) < q(t) T q(t) is replaced with a constraint on the integrals of 
p(t) T p(t) and q(t) T q(t): 

x = Ax + B p p + B u u + B w w, 

q = C q x + D qp p + D qu u + D qw w, 

z = C z x + D zp p+D zu u+D zw w ( 8 - 13 ) 

t r t 

T,_\ j_ ^ / / nT. 



p(r) p(r) dr < I q(r) q(r) dr. 
./o 
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Such a system is closely related to the NLDI with the same data matrices; indeed, 
every trajectory of the associated NLDI is a trajectory of (8.13). This system is 
not, however, a differential inclusion. In control theory terms, the system (8.13) is 
described as a linear system with (dynamic) nonexpansive feedback. An important 
example is when p and q are related by a linear system, i.e., 

±f = AfXf + Bjq, p=CfXf + Dfq, Xf(0) = 0, 

where \\D f + C f (sl - A^BfW^ < 1. 

We can also consider a generalization of the DNLDI, in which we have compo- 
nentwise integral quadratic constraints. Moreover, we can consider integral quadratic 
constraints with different sector bounds; as an example we will encounter constraints 
of the form L p(t) t q{r) dr > in §8.3. 

We will now show that many of the results on NLDIs (and DNLDIs) from Chap- 
ters 5-7 generalize to systems with integral quadratic constraints. We will demon- 
strate this generalization for stability analysis and L2 gain bounds, leaving others to 
the reader. 

As in Chapters 5-7, our analysis will be based on quadratic functions of the state 
V(£) = £, T P£, and roughly speaking, the very same LMIs will arise. However, the 
interpretation of V is different here. For example, in stability analysis, the V of 
Chapters 5-7 decreases monotonically to zero. Here, the very same V decreases to 
zero, but not necessarily monotonically. Thus, V is not a Lyapunov function in the 
conventional sense. 

8.2.1 Stability 

Consider system (8.13) without w and z. Suppose P > and A > are such that 

x T Px < A (p T p - q T q) , (8.14) 



or 



dt 

d 
dl. 



(x T Px+X J {q{r) T q{r) - p{r) T p{r)) drj < 0. 



Note that the second term is always nonnegative. Using standard arguments from 
Lyapunov theory, it can be shown that lim^^oo x(t) = 0, or the system is stable. 

Now, let us examine the condition (8.14). It is exactly the same as the condition 
obtained by applying the <S-procedure to the condition 

— X Px < 0, whenever p p < q q, 
dt 

which in turn, leads to the LMI condition for quadratic stability of NLDI (5.3) 
P > 0, A > 0, 



A T P +PA+ \C T q C q PB p + \C T q D qv 
(PB p + \CTD qp f -\{I - D T qp D qp ) 



<0. 



8.2.2 L 2 gain 

We assume that x(0) = and, for simplicity, that D zw = and D qp = 0. 
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Suppose P > and A > are such that 
—x T Px < A (p T p - q T q) + ^ 2 w T w - z T z. (8.15) 

Integrating both sides from to T, 

x{T) T Px{T) + A / (q(T) T q{T) - p(t) t P (t)) dr < f (-j 2 w T w - z T z) dt. 
Jo Jo 

This implies that the L2 gain from w to z for system (8.13) does not exceed 7. 
Condition (8.15) leads to the same LMI (6.55) which guarantees that the L 2 gain 
of the NLDI (4.9) from ictoz does not exceed 7. 

Remark: Almost all the results described in Chapters 5, 6 and 7 extend imme- 
diately to systems with integral quadratic constraints. The only exceptions are 
the results on coordinate transformations (§5.1.1, also §7.2.2), and on reachable 
sets with unit-peak inputs (§6.1.3 and §7.3.3). 



8.3 Multipliers for Systems with Unknown Parameters 

We consider the system 

x = Ax + B p p + B w w, q = C q x + D qp p, 

Pi = 5iqi, i = l,...,n p , z = C z x + D zw w, 

where p(t) G R™ p , and Si, i = 1, . . . , n p are any nonnegative numbers. We can consider 
the more general case when on < Si < Pi using a loop transformation (see the Notes 
and References). 

For reasons that will become clear shortly, we begin by defining n p LTI systems 
with input qi and output q m ,i, where q m ,i(t) € R: 

(8.1 t) 

where we assume that A m ^ is stable and (A mt i, B m ^) is controllable. We further 
assume that 

t 
Qi(T)q m ^(T) dr > for all t > 0. 



This last condition is equivalent to passivity of the LTI systems (8.17), which in turn 
is equivalent to the existence of Pi > 0, i = 1, . . . , n p , satisfying 



A T P. -j- P. A PR — C T 

m i 1 ^ «^ L m,« ± % ±J w\,i ^-^m i 



"m,i*i ~ C m ,i — \J-Jm,i + LJ 

(See §6.3.3.) 



m..ij 



< 0, i = l,...,n p . (8.18) 



Thus, q and q m = [q m ,i, . . . , q mn ] are the input and output respectively of the 
diagonal system with realization 

A m = diag(A m s), B m = diag(B mA ) , C m = diag(C m ,j), D m = diag(D m j), 
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Defining 
A 



B m C q A r 



Br, 



Bp 

^m^qp 



B,, 



B w 




Cq — [±J m Cq G m J, 



D„ 



D m D qp , C z = [C z 0]. 



q — l J -'m^~ y q ^m\t QP 

consider the following system with integral quadratic constraints: 
dx r_ 

— - = Ax + B p p + B w w, 
at 

Qm = (->q x + L)qpP, 

Z = C z £ + b qp p + D ZW W, 



Pi(T)q m ,i{T)dT>0, i = l,...,n p . 



(8.19) 



It is easy to show that if x is a trajectory of (8.16), then [x T x'j n } T is a trajectory 
of (8.19) for some appropriate x m . Therefore, conclusions about (8.16) have implica- 
tions for (8.19). For instance, if system (8.19) is stable, so is system (8.16). 



8.3.1 Stability 

Consider system (8.19) without w and z. Suppose P > is such that 
d 



dl 



Px + 2y^ j p i q mti < 0, 



(8.20) 



dl 



Z,T TDZ, 



n P ft 

Px + 2J2 

,_-, Jo 



Pi{T)q m ,i{ T ) dT \ < °- 



Then, it can be shown using standard arguments that Irnit^^ x(t) = 0, i.e., the 
system (8.19) is stable. 

Condition (8.20) is just the LMI in P > 0, C m and D m : 



\T 1 
IT ; 



A T P + PA PB p + C T q 



B$P+C q D qp + D T qp 



<0. 



.21) 



Remark: LMI (8.21) can also be interpreted as a positive dissipation con- 
dition for the LTI system with transfer matrix G(s) — —W(s)H(s), where 
W(s) = C m {sI-A m )- 1 B m +D m and if (s) = C 9 (sl - A)~ 1 B P + D qp . Therefore, 
we conclude that system (8.16) is stable if there exists a passive W such that 
— W(s)H(s) has positive dissipation. Hence W is often called a "multiplier". See 
the Notes and References for details. 



8.3.2 Reachable sets with unit-energy inputs 

The set of reachable states with unit-energy inputs for the system 



B w w, 



q = C q x + D q pp, 



x = Ax + B p p - 
p = oq, 5 > 0, 
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ls 



n u 



x(T) 



x, w satisfy (8.22), x(0) = 

,-T 

w T wdt < 1, T > 



We can obtain a sufficient condition for a point xq to lie outside the reachable set TZ V 
using quadratic functions for the augmented system. Suppose P > is such that 



dt 



-T 
-X 



Px + z^2piq m ,i < 



T 
W W, 



.23) 



or 



P1 + 2V/ Pi{T)q m ,i{ T ) dT < / w{T) T w{r)dT. 
„•_-, Jo Jo 



then the set £ = {x \ x T Px < 1 } contains the reachable set for the system (8.19). 
Condition (8.23) is equivalent to the LMI in P > 0, C m and D m : 



A T P + PA PB W PBp + C? 

B T W P -I 

B T p P + C q D qp + D T qp 



<0. 



.24) 



Now, the point xq lies outside the set of reachable states for the system (8.22) if 
there exists P satisfying (8.23) and there exists no z such that 

X 



Xq 
Z 



P 



< 1. 



Partitioning P conformally with the sizes of xq and z as 

Pn P12 
P 12 P11 



P 
we note that 



Xq 
Z 



Pll P\1 
P\1 Pll 



XQ 

Z 



x"S(Pu - PwP^P&xo 



Therefore, xq does not belong to the reachable set for the system (8.22) if there exist 
P > 0, P t > 0, A > 0, C m , and D M satisfying (8.18), (8.24) and 



X^PhXq - 1 X^Pi2 

P 12 x P22 

This is an LMIP. 



>0. 



Notes and References 



Lur'e systems 

The Notes and References of Chapters 5 and 7 are relevant to this chapter as well. Lur'e 
and Postnikov were the first to propose Lyapunov functions consisting of a quadratic form 
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plus the integral of the nonlinearity [LP44]. The book [Lur.57] followed this article, in which 
the author showed how to solve the problem of finding such Lyapunov functions analyti- 
cally for systems of order 2 or 3. Another early book that uses such Lyapunov functions is 
Letov [Let61]. Work on this topic was continued by Popov [Pop62], Yakubovich [Yak64]. 
Tsypkin [Tsy64d, Tsy64c, Tsy64a, Tsy64b], Szego [Sze63] and Meyer [Mey66]. Jury 
and Lee [JL65] consider the case of Lur'e systems with multiple nonlinearities and derive a 
corresponding stability criterion. See also [IR64]. 

The ellipsoid method is used to construct Lyapunov functions for the Lur'e stability problem 
in the papers by Pyatnitskii and Skorodinskii [PS82, PS83]. See also [Sko91a, Sko91b] for 
general numerical methods to prove stability of nonlinear systems via the Popov or circle 
criterion. A precursor is the paper by Karmarkar and Siljak [KS75], who used a locally con- 
vergent algorithm to determine margins for a Lur'e-Postnikov system with one nonlinearity. 

Indeed, in the early books by Lur'e, Postnikov, and Letov the problem of constructing Lya- 
punov functions reduces to the satisfaction of some inequalities. In some special cases they 
even point out geometrical properties of the regions defined by the inequalities, i.e., that they 
form a parallelogram. But as far as we know, convexity of the regions is not noted in the 
early work. In any case, it was not known in the 1940's and 1950's that solution of convex 
inequalities is practically and theoretically tractable. 

Construction of Lyapunov functions for systems with integral quadratic constraints 

We saw in §8.2.1 that the quadratic positive function V(£) — £, T P(, is not necessarily a 
Lyapunov function in the conventional sense for the system (8.13) with integral quadratic 
constraints; although V(x(t)) — » as t — » oo, it does not do so monotonically. We now show 
how we can explicitly construct a Lyapunov function given P, in cases when p and q are 
related by 

if - f(Xf,q,t), p = g(x f ,q,t), (8.25) 

where Xf(t) £ R nj! . 

Since J p T pdr < J q T qdr, it can be shown (see for example [Wil72]) that Vf : R n/ — > R+ 
given by 

V>(£/) = sup I - / (q T q - p T p) dt x f (0) = £j, x f satisfies (8.25), T > I 

is a Lyapunov function that proves stability of system (8.25). Moreover, the Lyapunov 
function V : R n x R n f — » R + given by 

v(Z,tf) = z T Pt + v f (tf) 

proves stability of the interconnection 

x — Ax + B p p, q — G\x + D qp p, 

if - f( x f,q,~t), V — 9(.x f ,q,t). 

Note that this construction also shows that £ Pt; proves the stability of the NLDI (5.2). 

Integral quadratic constraints have recently received much attention, especially in the former 
Soviet Union. See for example [Yak88, Meg92a, Meg92b, Yak92, Sav91, SP94b, SP94h, 
SP94d]. 

Systems with constant, unknown parameters 

Problems involving system (8.16) are widely studied in the field of robust control, sometimes 
under the name "real-/i problems"; see Siljak [Sil89] for a survey. The results of §8.3 were 
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derived by Fan, Tits and Doyle in [FTD91] using an alternate approach, involving the ap- 
plication of an off-axis circle criterion (see also §3.3). We also mention [TV91], where Tesi 
and Vicino study a hybrid system, consisting of a Lur'e system with unknown parameters. 

There are elegant analytic solutions for some very special problems involving parameter- 
dependent linear systems, e.g., Kharitonov's theorem [Kha78] and its extensions [BHL89]. 
A nice survey of these results can be found in Barmish [Bar.93]. 

Complexity of stabilization problems 

The stability (and performance) analysis problems considered in §8.3 have high complexity. 
For example, checking if (8.16) is stable is NP-hard (see Coxson and Demarco [CD91], Braatz 
and Young [BYDM93], Poljak and Rohn [PR94], and Nemirovskii [Nem94]). We conjecture 
that checking whether a general DNLDI is stable is also NP-hard. 

Likewise, many stabilization problems for uncertain systems are NP-hard. For instance, the 
problem of checking whether system (8.16) is stabilizable by a constant state- feedback can 
be shown to be NP-hard using the method of [Nem94]. 

A related result, due to Blondel and Gevers [BG94], states that checking whether there 
exists a common stabilizing LTI controller for three LTI systems is undecidable. In contrast, 
checking whether there exists a static, output-feedback control law for a single LTI system 
is rationally decidable [ABJ75]. 

Multiplier methods 

See [Wil69a, NT73, DV75, Sil69, Wil70, Wil76] for discussion of and bibliography on 
multiplier theory and its connections to Lyapunov stability theory. In general multiplier 
theory we consider the system 

x — Ax + B p p + B w w, q — C q x + D qp p, p(t) = A(q,t). 

The method involves checking that H m = W [C q (sl — A) -1 B v + D qp ) satisfies 
H m (ju>) + H m (ju))* > for every u) e R, 

for some choice of W from a set that is determined by the properties of A. When A is 
a nonlinear time-invariant memoryless operator satisfying a sector condition (i.e., when we 
have a Lur'e system), the multipliers W are of the form (1 + qs) for some q > 0. This is 
the famous Popov criterion [Pop62, Pop64]; we must also cite Yakubovich, who showed, 
using the 5-procedure [Yak77], that feasibility of LMI (8.6) is equivalent to the existence of 
q nonnegative such that (1 + qs)(c q (sl — A)~ b p ) + 1/fc has positive dissipation. 

In the case when A(q,t) = 5q(t) for some unknown real number S, the multiplier W can be 
any passive transfer matrix. This observation was made by Brockett and Willems in [BW65]; 
given a transfer function H qp they show that the transfer function H qp / '(l + kH qp ) is stable for 
all values of k in (0, oo) if and only if there exists a passive multiplier W such that WH qp is 
also passive. This case has also been considered in [CS92b, SC93, SL93A, SLC94] where the 
authors devise appropriate multipliers for constant real uncertainties. The paper [BHPD94] 
by Balakrishnan et al., shows how various stability tests for uncertain systems can be red- 
erived in the context of multiplier theory, and how these tests can be reduced to LMIPs. See 
also [LSC94]. 

Safonov and Wyetzner use a convex parametrization of multipliers found by Zames and 
Falb [ZF68, ZF67] to prove stability of systems subject to "monotonic" or "odd-monotonic" 
nonlinearities (see [SW87] for details; see also [GG94]). Hall and How, along with Bern- 
stein and Haddad, generalize the use of quadratic Lyapunov functions along with multipliers 
for various classes of nonlinearities, and apply them to Aerospace problems; see for ex- 
ample [HB91A, HOW93, HH93B, HHHB92, HH93A, HHH93A, HCB93, HB93B, HB93A, 
HHH93B]. See also [CHD93]. 
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Multiple nonlinearities for Lur'e systems 

Our analysis method for Lur'e systems with multiple nonlinearities in §8.1 can be conser- 
vative, since the S-procedure can be conservative in this case. Rapoport [Rap86, Rap87, 
Rap88] devises a way of determining if there exists Lyapunov functions of the form (8.3) 
nonconservatively by generating appropriate LMIs. We note, however, that the number of 
these LMIs grows exponentially with the number of nonlinearities. Kamenetskii [Kam89] 
and Rapoport [Rap89] derive corresponding frequency-domain stability criteria. Kamenet- 
skii calls his results "convolution method for solving matrix inequalities" . 

Loop transformations 

Consider the Lur'e system with general sector conditions, i.e., 

x = Ax + Bp, q- Cx + Dp, 

Pi(t) = 4>i(<li(t)), a l( 7 2 < a<t> t (a) < fca 2 . 

Here we have dropped the variables w and z and the subscripts on B, C and D to simplify 
the presentation. We assume this system is well-posed, i.e., det(J — DA) 7^ for all diagonal 
A with m < An < pi. 

Define 

Pi - -5 (<f>i(<li) ~ ";90 = 4>i{li)- 

Pi - OLi 

It is readily shown that < a<f>i(a) < a 2 for all a. Let A and T denote the diagonal matrices 

A = diag(ai,...,a„J, F = diag(/3i - ai, . . . ,/?„, -«„,), 

so that p — r _1 (p — Aq). We now substitute p — Tp + Ag into (8.26) and, using our well- 
posedness assumption, solve for x and q in terms of x and p. This results in 

x = (A + BA(I- DA)~ 1 C)x + B(I-AD)~ 1 Tp, 

q = (I - Dk^Cx + ^I -Dk^DYp. 

We can therefore express (8.26) as 



x — Ax + Bp, q — Cx + Dp, 

pi(t) = $i(qi(t)), 0<a$i{a) < a 2 . 



(8.27) 



where 

A = A + BA{I- DA)~ 1 C, B = B(I- AD) _1 r, 
C ={I - DA)~ 1 C, D = {I - DA)- X DT. 

Note that (8.27) is in the standard Lur'e system form. So to analyze the more general Lur'e 
system (8.26), we simply apply the methods of §8.1 to the loop-transformed system (8.27). 
In practice, i.e., in a numerical implementation, it is probably better to derive the LMIs 
associated with the more general Lur'e system than to loop transform to the standard Lur'e 
system. 

The construction above is a loop transformation that maps nonlinearities in sector [ai,/3i] 
into the standard sector, i.e., [0, 1]. Similar transformations can be used to map any set of 
sectors into any other, including so-called infinite sectors, in which, say j3i — +00. 

The term loop transformation comes from a simple block-diagram interpretation of the equa- 
tions given above. For detailed descriptions of loop transformations, see the book by Desoer 
and Vidyasagar [DV75, p50]. 
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Chapter 9 

Systems with Multiplicative Noise 

9.1 Analysis of Systems with Multiplicative Noise 

9.1.1 Mean-square stability 

We first consider the discrete-time stochastic system 

(9.1) 



s(fc + l)= (A + J2A iPi (k))x{k), 



where p(0), p(l), . . ., are independent, identically distributed random variables with 

Ep(fc)=0, Ep(fc)p(fc) T = E = diag(a 1 ,..., f j L ). (9.2) 

We assume that x(0) is independent of the process p. 
Define M(k), the state correlation matrix, as 

M{k) = F,x{k)x{k) T . 

Of course, M satisfies the linear recursion 

L 

M{k+1) = AM{k)A T + ^ j a 2 l A i M{k)Aj, M{0) = Ex{0)x{0) T . (9.3) 

4=1 

If this linear recursion is stable, i.e., regardless of x(0), linifc^oo M(k) = 0, we say 
the system is mean-square stable. Mean-square stability implies, for example, that 
x(k) — > almost surely. 

It can be shown (see the Notes and References) that mean-square stability is 
equivalent to the existence of a matrix P > satisfying the LMI 

L 

A T PA -P + J2 a i A I PA i < °- ( 9 - 4 ) 

8 = 1 

An alternative, equivalent condition is that the dual inequality 
L 
AQA T -Q + Y, <7i A iQ A I < ° ( 9 - 5 ) 

i=l 

holds for some Q > 0. 
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Remark: For P > satisfying (9.4), we can interpret the function V(£) = £ T -P£ 
as a stochastic Lyapunov function: EV(i) decreases along trajectories of (9.1). 
Alternatively, the function V(M) — TV MP is a (linear) Lyapunov function for 
the deterministic system (9.3) (see the Notes and References). 



Remark: Mean-square stability can be verified directly by solving the Lyapunov 
equation in P 



A T PA -P + Y^ a i A ^ PA i +1 = 



and checking whether P > 0. Thus the LMIPs (9.4) and (9.5) offer no computa- 
tional (or theoretical) advantages for checking mean-square stability. 

We can consider variations on the mean-square stability problem, for example, 
determining the mean-square stability margin. Here, we are given A , . . . , A^, and 
asked to find the largest 7 such that with S < 7 2 /, the system (9.1) is mean-square 
stable. From the LMIP characterizations of mean-square stability, we can derive 
GEVP characterizations that yield the exact mean-square stability margin. Again, 
the GEVP offers no computational or theoretical advantages, since the mean-square 
stability margin can be obtained by computing the eigenvalue of a (large) matrix. 

9.1.2 State mean and covariance bounds with unit-energy inputs 

We now add an exogenous input w to our stochastic system: 

L 

x{k + 1) = Ax(k) + B w w(k) + ^2(Aix(k) + B w4 w{k))pi(k), x(0) = 0. (9.6) 

8 = 1 

We assume that the exogenous input w is deterministic, with energy not exceeding 
one, i.e., 



^2w{k) T w{k) < 1. 



fe=0 

Let x(k) denote the mean of x(k), which satisfies x(k + 1) = Ax(k) + B w w(k), and let 
X(k) denote the covariance of x(k), i.e., X(k) = E (x(k) — x(k)) (x(k) — x(k)) . We 
will develop joint bounds on x(k) and X(k). 

Suppose V(£) = (, T P£ where P > satisfies 

F,V{x{k+l)) < F,V{x{k)) + w{k) T w{k) (9.7) 

for all trajectories. Then, 

k 

EV(x(k))<Y,MJ) T HJ), k>0, 

which implies that, for all k > 0, 

F,x{k) T Px{k) = x{k) T Px{k) + F, {x{k) - x{k)) T P {x(k) - x{k)) 
= x{k) T Px{k) + Tr X{k)P 
< 1. 
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Let us now examine condition (9.7). We note that 

-I T 



EV(x(k+l)) 



x{k) 

w(k) 



M 



x(k) 

w(k) 



TV A T PA + Y] ViAjPAi X(k), 



where 



M 



A T 



PA B„ 



£< 



B w,i 



P A,, B„ 



Next, the LMI conditions 

A T 
B„„ 



PA B„ 



E*? 



B, 



P A> B v 



P 
/ 



and 



A T PA + J2 afAjPAi < P 



imply 



~EV(x(k + l)) < x T (k)Px(k) + w(k) T w{k) 



Tr ( A T PA + Y, ^AjPAA X(k) 



(9.8) 



(9.9) 



< x T (k)Px{k) + w{k) T w(k) 

+ E (x(k) - x{k)) T P (x(k) - x{k)) 

= F,V{x{k)) + w{k) T w{k), k > 0. 

Of course, LMI (9.8) implies LMI (9.9); thus, we conclude that LMI (9.8) im- 
plies (9.7). Therefore, we conclude that x(k) and X(k) must belong to the set 

{(x(k),X(k)) | x{k) T Px{k) + TV X{k)P < 1} 

for any P > satisfying the LMI (9.8). For example, we have the bound TV M(k)P < 1 
on the state covariance matrix. 

We can derive further bounds on M(k). For example, since we have TVM(fc)_P < 
A min (P) TV M{k) and therefore TV M{k) < l/A min (P), we can derive bounds on TV M 
by maximizing A m j n (P) subject to (9.8) and P > 0. This is an EVP. As another 
example, we can derive an upper bound on A max (M) by maximizing TV P subject 
to (9.8) and P > 0, which is another EVP. 



9.1.3 Bound on L 2 gain 

We now consider the system 

L 

x(k + 1) = Ax(k) + B w w(k) + ^ ( A i x ( k ) + B Wti w{k))pi{k), x(0) = 0, 

L 

z{k) = C z x(k) + D zw w{k) + Y (C x ,ix(k) + D ZWti w(k))pi(k). 

i=l 
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with the same assumptions on p. We assume that w is deterministic. 
We define the L2 gain r\ of this system as 



,2 A 



>r 



sup I E^ z(k) T z(k) 



fe=0 



^Tw{k) T w{k) < 1 



fe=0 



Suppose that V(£) = (, T P(,, with P > 0, satisfies 

EF(i(Hl)) -EV(i(fe)) < 7 2 w(fc) T u;(fc)-Ez(fc) T z(fc). 
Then 7 >r). The condition (9.12) can be shown to be equivalent to the LMI 

P 

7 2 / 



A B 

(-^z P>zw 





T 


' P " 


" 


A B w 






) 




/ 


^ z i-J zw 




A t 


-E>w,i 


T 


' P " 




A 




*~1 

^ ' Z^l 


U zw,i 




/ 




_ c z 



D zw.i 



< 0. 



(9.11) 



(9.12) 



(9.13) 



Minimizing 7 subject to LMI (9.13), which is an EVP, yields an upper bound on 77. 

Remark: It is straightforward to apply the scaling method developed in §6.3.4 
to obtain componentwise results. 



9.2 State-Feedback Synthesis 

We now add a control input u to our system: 

x(k+l) = Ax(k) + B u u(k) + B w w(k)+ 

L 

y^ {Aix(k) + B u ju(k) + B w>i w(k))pi(k), 

z{k) = C z x{k) + D zu u{k) + D zw w{k)+ 

L 

^ (C z ,ix(k) + D ZUji u(k) + D zw jw(k))pi(k). 

i=\ 

We seek a state-feedback u(k) = Kx(k) such that the closed-loop system 
x(k+l) = (A + B u K)x{k) + B w w(k)+ 

L 

Y, ((At + B Uii K)x(k) + B Wii w(k))pi(k), 

z{k) = {C z + D zu K)x{k) + D zw w{k)+ 

L 

Y, (( c z,i + D ZUti K)x(k) + D zw4 w(k))pi(k), 
satisfies various properties. 

9.2.1 Stabilizability 

We seek K and Q > such that 

{A+B U K)Q{A + B U K) T -Q 

L 

+ J2 °i( A i + B u ^K)Q{Ai + B U ^K) T < 0. 



(9.14) 



(9.15) 
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With the change of variable Y = KQ, we obtain the equivalent condition 
(AQ + BJOQ-^AQ + B U Y) T - Q 

L 

+ Y, <Ti(AiQ + B Uii Y)Q-\AiQ + B U ^Y) T < 0, 



(9.16) 



which is easily written as an LMI in Q and Y. 

Thus, system (9.14) is mean-square stabilizable (with constant state-feedback) if 
and only if the LMI Q > 0, (9.16) is feasible. In this case, a stabilizing state-feedback 
gain is K = YQ _1 . 

As an extension, we can maximize the closed-loop mean-square stability margin. 
System (9.15) is mean-square stable for E < j 2 I if and only if 



(AQ + B u Y)Q- l (AQ + B U Y) T - Q 



J2(AiQ + B Ui jr)Q~HAiQ + B u ,iY) T < 



(9.17) 



holds for some Y and Q > 0. Therefore, finding K that maximizes 7 reduces to the 
following GEVP: 

maximize 7 

subject to Q > 0, (9.17) 



9.2.2 Minimizing the bound on L2 gain 

We now seek a state- feedback gain K which minimizes the bound on L2 gain (as 
defined by (9.11)) for system (9.15). rj < 7 for some K if there exist P > and K 
such that 



E*? 



A t + B U ,K 



B„ 



1 T 



a 



D ZU ,K D 2 



P 



^2,t ~r Uzu.i*^- Uzw.i 



where 



A + B U K B u 
C Z + D ZU K D„ 



P 
/ 



p 



A + B U K B, u 
C Z + D ZU K D z , 



P 

7 2 / 



We make the change of variables Q = P , Y = KQ. Applying a congruence 
with Q = diag(Q, J), we get 



£-■ 



A t Q + B Uti Y B 
C z ,iQ + D ZUii Y D x 

AQ + B U Y B 
C Z Q + D ZU Y D, 



-1 T 



Q- 



AiQ + B Uti Y B u 

(^z.iQ + P>zu,il JJ Zl 



1 T 



Q- 1 



AQ + B U Y B v 
C Z Q + D ZU Y D z . 



< 



Q 

7 2 / 



This inequality is readily transformed to an LMI. 
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Notes and References 

Systems with multiplicative noise 

Systems with multiplicative noise are a special case of stochastic systems. For an introduction 
to this topic see Arnold [Arn74] or Astrom [Ast70]. 

A rigorous treatment of continuous-time stochastic systems requires much technical machin- 
ery (see [EG94]). In fact there are many ways to formulate the continuous-time version of a 
system such as (9.10); see Ito [Ito51] and Stratonovitch [Str66]. 



Stability of system with multiplicative noise 

For a survey of different concepts of stability for stochastic systems see [Koz69] and the 
references therein. The definition of mean-square stability can be found in [Sam59]. Mean- 
square stability is a strong form of stability. In particular, it implies stability of the mean 
Ex(k) and that all trajectories converge to zero with probability one [Kus67, Wil73]. For the 
systems considered here, it is also equivalent to Li2-stability, which is the property that (see 
for example, [EP92]) E^\ ||x(t)|| 2 has a (finite) limit as T — » oo for all initial conditions. 

The equation for the state correlation matrix (9.3) can be found in [McL71, Wil73]. An alge- 
braic criterion for mean-square stability of a system with multiplicative white noise is given 
in [NS72]. It is reminiscent of the classical (deterministic) Hurwitz criterion. Frequency- 
domain criteria are given in [WB71]. The related Lyapunov equation was given by Klein- 
mann [Kle69] for the case L — 1 (i.e., a single uncertainty). The criterion then reduces 
to an H2 norm condition on a certain transfer matrix. (This is in parallel with the results 
of [EP92], see below.) 

Kats and Krasovskii introduced in [KK60] the idea of a stochastic Lyapunov function and 
proved a related Lyapunov theorem. A thorough stability analysis of (nonlinear) stochastic 
systems was made by Kushner in [Kus67] using this idea of a stochastic Lyapunov func- 
tion. Kushner used this idea to derive bounds for return time, expected output energy, etc. 
However, this approach does not provide a way to construct the Lyapunov functions. 



Proof of stability criterion 

We now prove that condition (9.4) is a necessary and sufficient condition for mean-square 
stability. We first prove that condition (9.4) is sufficient. Let P > satisfy (9.4). Introduce 
the (linear Lyapunov) function V(M) = Tr MP, which is positive on the cone of nonnegative 
matrices. For nonzero M(k) satisfying (9.3), 



V(M(k + l)) 



Tr ( AM(k)A T + ^ o-fAiM(k)Al J P 

= Tr M(fc) ( A T PA + ^ ViAjPAi J 

< Tr M(k)P = V(M(k)). 

A further standard argument from Lyapunov theory proves that M(k) converges to zero as 

k — > 00. 

To prove that condition (9.4) is also necessary, we assume that the system is mean-square 
stable. This means that for every positive initial condition M(0) > 0, the solution to the 
linear recursion (9.3) goes to zero as k — » 0. Linearity of system (9.3) implies that, for an 
arbitrary choice of M(0), the corresponding solution M(k) converges to 0, i.e., (9.3) is stable. 
We now write (9.3) as m(k + 1) — Am(k), where m(k) is a vector containing the n elements 
of M(k), and A. is a real matrix. (This matrix can be expressed via Kronecker products 
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involving Aq, . . . , At-) Stability of A is equivalent to that of A T , and the corresponding 
matrix difference equation, 

L 

N(k + 1) = A T N(k)A + Y^ ViAjN(k)Ai, (9.18) 

»=i 

is also stable. 

Now let N(0) > 0, and let N(k) be the corresponding solution of (9.18). Since N(k) satisfies 
a stable first-order linear difference equation with constant coefficients, the function 

k 

P(k) = ]T N(j) 

3=0 

has a (finite) limit for k — > oo, which we denote by P. The fact that N(0) > implies P > 0. 
Now (9.18) implies 

L 

P(k + 1) = N(0) + A T P(k)A + Y <J*AlP(k)Ai 

Taking the limit k — » oo shows that P satisfies (9.4). 

Mean-square stability margin 

System (9.10) can be viewed as a linear system subject to random parameter variations. For 
systems with deterministic parameters only known to lie in ranges, the computation of an 
exact stability margin is an NP-hard problem [CD92]. The fact that a stochastic analog 
of the deterministic stability margin is easier to compute should not be surprising; we are 
dealing with a very special form of stability, which only considers the average behavior of the 
system. 

An exact stability margin can also be computed for continuous-time systems with multiplica- 
tive noise provided the Ito framework is considered; see [EG94]. In contrast, the computation 
of an exact stability margin seems difficult for Stratonovitch systems. 

In [EP92], El Bouhtouri and Pritchard provide a complete robustness analysis in a slightly 
different framework than ours, namely for so-called "block-diagonal" perturbations. Roughly 
speaking, they consider a continuous-time Ito system of the form 

L 

dx(t) -= Ax(t)dt + y^ BiAi(Cx(t))dpi{t), 

j=i 

where the operators Aj are Lipschitz continuous, with Aj(0) = 0; the processes Pi, i — 
1,...,L, are standard Brownian motions (which, roughly speaking, are integrals of white 
noise). 

We note that our framework can be recovered by assuming that the operators A; are restricted 
to a diagonal structure of the form A;(.z) = <nz. 

Measuring the "size" of the perturbation term by (y^._ 1 ||Ai|| 2 J , where || • || denotes the 
Lipschitz norm, leads the authors of [EP92] to the robustness margin 

minimize 7 

subject to P > 0, A T P + PA + C T C < 0, Bf PB t < 7, i = l,...,L 
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For L — 1, and B\ and C are vectors, that is, when a single scalar uncertain parameter 
perturbs an LTI system, both this result and ours coincide with the H 2 norm of (^4, Bi,C). 
This is consistent with the earlier results of Kleinmann [Kle69] and Willems [WB71] (see 
also [BB91, P114] for a related result). For L > 1 however, applying their results to our 
framework would be conservative, since we only consider "diagonal" perturbations with "re- 
peated elements", A; = ail. Note that the LMI approach can also solve the state-feedback 
synthesis problem in the framework of [EP92]. El Bouhtouri and Pritchard provide a solution 
of this problem in [EP94]. 

Stability conditions for arbitrary noise intensities are given in geometric terms in [WW83]. 
In our framework, this happens if and only if the optimal value of the corresponding GEVP 
is zero. 



Bounds on state covariance with noise input 

We consider system (9.6) in which to is a unit white noise process, i.e., w(k) are independent, 
identically distributed with Ew(k) — 0, Ew(k)w(k) T = W, and w is independent of p. 
In this case, of course, the state mean x is zero. We derive an upper bound on the state 
correlation (or covariance) of the system. 

The state correlation M(k) satisfies the difference equation 

L 

M(k + 1) = AM(k)A T + B w WBl + ^ <% (AiM(k)Af + B Wti WBl 



i i 



with M(0) = 0. 

Since the system is mean-square stable, the state correlation M(k) has a (finite) limit M 
which satisfies 



AU 



AM ao A T + B w WBl + Y^ ^ (A.M^Aj + B Wli WBl 



In fact, the matrix M^, can be computed directly as the solution of a linear matrix equation. 
However, the above LMI formulation extends immediately to more complicated situations (for 
instance, when the white noise input w has a covariance matrix with unknown off-diagonal 
elements), while the "direct" method does not. See the Notes and References for Chapter 6 
for details on how these more complicated situations can be handled. 

Extension to other stochastic systems 

The Lur'e stability problem considered in §8.1 can be extended to the stochastic framework; 
see Wonham [Won66]. This extension can be cast in terms of LMIs. 

It is also possible to extend the results in this chapter to "uncertain stochastic systems" , of 
the form 

x(k + 1) = ( A{k) + Y ViA i p l {k) ] x(k), 

where the time-varying matrix A{k) is only known to belong to a given polytope. 

State-feedback synthesis 

Conditions for stabilizability for arbitrary noise intensities are given in geometric terms 
in [WW83]. Apart from this work, the bulk of earlier research on this topic concentrated 

Copyright © 1994 by the Society for Industrial and Applied Mathematics. 



Notes and References 139 

on Linear-Quadratic Regulator theory for continuous-time Ito systems, which addresses the 
problem of finding an input u that minimize a performance index of the form 

J(K) = E (x(t) T Qx{t) + u(t) T Ru(t)) dt 

Jo 

where Q > and R > are given. The solution of this problem, as found in [McL71] 
or [Ben92], can be expressed as a (linear, constant) state-feedback u(t) — Kx(t) where the 
gain K is given in terms of the solution of a non-standard Riccati equation. (See [Kle69, 
Won68, FR75, BH88b] for additional details.) The existing methods (see e.g. [Phi89, 
RSH90]) use homotopy algorithms to solve these equations, with no guarantee of global con- 
vergence. We can solve these non-standard Riccati equations reliably (i.e., with guaranteed 
convergence), by recognizing that the solution is an extremal point of a certain LMI feasibility 
set. 
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Chapter 10 

Miscellaneous Problems 

10.1 Optimization over an Affine Family of Linear Systems 

We consider a family of linear systems, 

x = Ax + B w w, z = C z {9)x + D zw (9)w, (10-1) 

where C z and D zw depend affinely on a parameter 9 G R p . We assume that A is 
stable and (A, B w ) is controllable. The transfer function, 

H e {s) = C z (9)(sl - A)- 1 ^ + D zw (9), 

depends affinely on 9. 

Several problems arising in system and control theory have the form 

minimize (pn(Ha) 

* v ; (10.2) 

subject to ifii(Hg) < o>i, i = 1, . . . ,p 

where ipi are various convex functionals. These problems can often be recast as LMI 
problems. To do this, we will represent tpi(Hg) < a» as an LMI in 9, on, and possibly 
some auxiliary variables Q (for each i): 

Fi{9,ai,Ci)>0. 
The general optimization problem (10.2) can then be expressed as the EVP 

minimize ao 

subject to Fi(0,ai,Q) > 0, i = 0,l,...,p 

10.1.1 H2 norm 

The H2 norm of the system (10.1), i.e., 



1 r°° 

\H e \\l = — TV / H e (j U yH (jw) du;, 
27T Jo 



is finite if and only if D zw (9) = 0. In this case, it equals TV C z W c Cj , where W c > is 
the controllability Gramian of the system (10.1) which satisfies (6.6). Therefore, the 
H 2 norm of the system (10.1) is less than or equal to 7 if and only if the following 
conditions on 9 and 7 2 are satisfied: 

D zw (9)=0, TV C z (9)W c C z (9f < 7 2 . 

The quadratic constraint on 9 is readily cast as an LMI. 
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10.1.2 Hqo norm 

From the bounded-real lemma, we have ||-He||oo < 7 if and only if there exists P > 
such that 



A 1 P+PA PB n 



B T P 



-1 2 I 



C Z (0) T 
D ZW (0) T 



C z (0) D zw {0) ] < 0. 



Remark: Here we have converted the so-called "semi-infinite" convex constraint 
||i7e(io;)|| < 7 for all u € R into a finite-dimensional convex (linear matrix) 
inequality. 



10.1.3 Entropy 

The ^-entropy of the system (10.1) is defined as 



UH, 



A 



lK a 6) 



-7 

2tt 
00, 



logdet(J - "f 2 Hg{iLo)H e {iLu)*) du, if \\H e \\oo < 7, 

otherwise. 



When it is finite, I 1 {Hg) is given by TV B^PB W , where P is a symmetric matrix 
with the smallest possible maximum singular value among all solutions of the Riccati 
equation 

A T P +PA+ C z (0) T C z (0) + \pb w B t w P = 0. 

7 

For the system (10.1), the 7-entropy constraint I 1 < A is therefore equivalent to an 
LMIin 0, P= P T , 7 2 , and A: 



D zw {0) = 0, 



A T P+PA PB W C Z (9) T 
BlP -7 2 / 

C z {9) -/ 



<0, 



TyBZPB w <\. 



10.1.4 Dissipativity 

Suppose that w and z are the same size. The dissipativity of H§ (see (6.59)) exceeds 
7 if and only if the LMI in the variables P = P T and holds: 



< 0. 



A T P+PA PB w -C z {0) T 

B T w P-C z {0) 2 1 I-D zw {e)-D zw {e) T 

We remind the reader that passivity corresponds to zero dissipativity. 

10.1.5 Hankel norm 

The Hankel norm of Hg is less than 7 if and only if the following LMI in Q, and 7 2 
holds: 

D zw (9)=0, A T Q + QA + C z (0fC z (0)<O, 
7 2j _ Wc /2 QWc /2 > 0, Q > 0. 

(W c is the controllability Gramian defined by (6.6).) 
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10.2 Analysis of Systems with LTI Perturbations 

In this section, we address the problem of determining the stability of the system 

B p p, q = C q x + D qp p, Pl = S i *q i , i = l,...,n p , (10.3) 



x = Ax - 
where p = [pi 



Pn 



Q 



[91 



<b, 



and 5i, i 



1, 



°p> 



are impulse 



responses of single-output single-output LTI systems whose H^ norm is strictly less 
than one, and * denotes convolution. This problem has many variations: Some of the 
6i may be impulse response matrices, or they may satisfy equality constraints such as 
S\ = 62- Our analysis can be readily extended to these cases. 

Let H q p(s) = C q {sl — A)~ 1 B p + D qp . Suppose that there exists a diagonal transfer 
matrix W(s) = diag(JFi(s), . . . , W n (s)), where W\(s), ..., W n (s) are single-input 
single-output transfer functions with no poles or zeros on the imaginary axis such 
that 



sup \\W(iu))H qp (iu))W(iu>) 



< 1 



(10.4) 



Then from a small-gain argument, the system (10.3) is stable. Such a IF is referred 
to as a frequency- dependent scaling. 

Condition (10.4) is equivalent to the existence of 7 > such that 

H qp (iu))*W(iuj)*W(iuj)H qp (iuj) - W(iu)*W{iu) + 7/ < 0, 

for all u) £ R. From spectral factorization theory, it can be shown that such a W 
exists if and only if there exists a stable, diagonal transfer matrix V and positive p 
such that 



H qp (iuj)*(V(iu>) + V{iuj)*)H qp {iuj) - {V{iuj) + V{iuj)*) + 7/ < 0, 
V(iu>) + V{iuj)* > fil, 



(10.5) 



for all uj £ R. 

In order to reduce (10.5) to an LMI condition, we restrict V to belong to an affine, 
finite-dimensional set of diagonal transfer matrices, i.e., of the form 

Vg(s) = Dv(0) + C v (9)(al - Ay l B, 

where Dy and Cy are affine functions are 6. 

Then, H qp (—s) T Ve(s)H qp (s) — V$(s) has a state-space realization (A aug , B &ug , 
Caug(^)j -Daug(^)) where C aug and D aug are affine in 6, and (^4 a ugj ^aug) is controllable. 

From §10.1.4, the condition that V$(iu>) + V$(iu>)* > (il for all u) £ R is equivalent 
to the existence of Pi > 0, 9 and 7 > such that the LMI 



A? Pi + PiAy 
(PlBy - Cy(0) T f 



PlBy - Cy{6) T 

-(D v (6) + D v (d) T ) + pI 



< 



holds. 

The condition H qp (iu>)*(Vg(iu>) + V g (icj)*)H qp (icj) - {Vg(iu) + V (iuj)*) + 7/ < 
is equivalent to the existence of P2 = P% and 6 satisfying the LMI 



(P2B 



^aug-P2 



iaug 
\T\T 



-"aug T >^aug(") 



c a 



ill'S 



+ Ganger V D aug (6) + D ang (ey + 7 / 



< 0. 



(10.6) 



(See the Notes and References for details.) Thus proving stability of the system (10.3) 
using a finite-dimensional set of frequency-dependent scalings is an LMIP. 
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10.3 Positive Orthant Stabilizability 

The LTI system x = Ax is said to be positive orthant stable if cc(0) £ R™ implies that 
x{t) £ R" for all t > 0, and x(t) — > as t — > oo. It can be shown that the system 
x = Ax is positive orthant stable if and only if Aij > for i ^ j, and there exists 
a diagonal P > such that PA T + AP < 0. Therefore checking positive orthant 
stability is an LMIP. 

We next consider the problem of finding K such that the system x = (A + BK)x 
is positive orthant stable. Equivalently, we seek a diagonal Q > and K such that 
Q(^ + BK) T + {A + BK)Q < 0, with the off-diagonal entries of A + BK being 
nonnegative. Since Q > is diagonal, the last condition holds if and only if all 
the off-diagonal entries of (A + BK)Q are nonnegative. Therefore, with the change of 
variables Y = KQ, proving positive orthant stabilizability for the system x = Ax + Bu 
is equivalent to finding a solution to the LMIP with variables Q and Y: 



> 0, (AQ + BY)ij > 0, i ± j, QA 1 



Y T B T 



AQ + BY<0. (10.7) 



Remark: The method can be extended to invariance of more general (polyhedral) 
sets. Positive orthant stability and stabilizability of LDIs can be handled in a 
similar way. See the Notes and References. 



10.4 Linear Systems with Delays 

Consider the system described by the delay-differential equation 
d 



dt 



r;{t) = Ax{t) + Y J A t x{t-T l ), 



(10.8) 



where x(t) £ R™, and Tj > 0. If the functional 



V(x, t) = x{t) T Px{t) + ^2 x (t~ S ) T - P * a: ( f ~ s ) ds > 



(10.9) 



where P > 0, P\ > 0, . . . , Pl > 0, satisfies dV(x, t) /dt < for every x satisfying (10. 
then the system (10.8) is stable, i.e., x(t) — * as t — > oo. 
It can be verified that dV(x,t)/dt = y(t) T Wy(t), where 



W 



A T P + PA + Y^ Pi PM 
A\P ' ' -P 1 



A\P 







PA L 



-Pl 



y(t) 



x(t) 
x(t- ri) 

x(t- t l ) 



Therefore, we can prove stability of system (10.8) using Lyapunov functionals of the 
form (10.9) by solving the LMIP W < 0, P > 0, P x > 0, . . . , P L > 0. 
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Remark: The techniques for proving stability of norm-bound LDIs, discussed 
in Chapter 5, can also be used on the system (10.8); it can be verified that the 
LMI (5.15) that we get for quadratic stability of a DNLDI, in the case when the 
state a; is a scalar, is the same as the LMI W < above. We also note that we 
can regard the delay elements of system (10.8) as convolution operators with unit 
T-i2 gain, and use the techniques of §10.2 to check its stability. 



Next, we add an input u to the system (10.8) and consider 



-rx(t) = Ax(t) + Bu{t) + Y^ A lX (t - n). 



<ll 



(10.10) 



j=l 



We seek a state-feedback u(t) = Kx(t) such that the system (10.10) is stable. 

From the discussion above, there exists a state- feedback gain K such that a Lya- 
punov functional of the form (10.9) proves the stability of the system (10.10), if 



{A + BKfP + P{A + BK) + Y p i PA i ■■■ pa l 
A\P -P x ■■■ 



W 



A\P 







-Pi. 



<0 



for some P, P\, . . . ,Pl > 0. 

Multiplying every block entry of W on the left and on the right by P _1 and setting 
Q = P~ 1 ,Q i = p- 1 P i P~ 1 and Y = KP" 1 , we obtain the condition 



X 



AQ + QA T + BY + Y T B T + Y,Q t MQ 
QAJ -Qx 



A L Q 





QA 







<0. 



Thus, checking stabilizability of the system (10.8) using Lyapunov functionals of the 
form (10.9) is an LMIP in the variables Q, Y , Qi, . . . , Ql- 

Remark: It is possible to use the elimination procedure of §2.6.2 to eliminate 
the matrix variable Y and obtain an equivalent LMIP with fewer variables. 



10.5 Interpolation Problems 

10.5.1 Tangential Nevanlinna-Pick problem 

Given Ai,...,A TO with A^ G C + = {s\ Res > 0} and distinct, u\, . . . ,u m , with 
Ui € C q and Vi, ■ ■ ■ , v m , with m G C p , i = 1, . . . , m, the tangential Nevanlinna-Pick 
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problem is to find, if possible, a function H : C — > C pxq which is analytic in C+, 
and satisfies 

H{\ i )u i = v r , i = \,...,m, with Hfflloo < 1. (10.11) 

Problem (10.11) arises in multi-input multi-output H^ control theory. 

From Nevanlinna-Pick theory, problem (10.11) has a solution and only if the Pick 
matrix TV defined by 

N = <u 3 -v*v 3 
3 K + A,) 

is positive semidefinite. N can also be obtained as the solution of the Lyapunov 
equation 

A*N +NA- (U*U - V*V) = 0. 

where A = diag(A!, . . . , A m ), U = [u\ ■ ■ ■ u m ], V = [vi ■ ■ ■ v m ]. For future reference, 
we note that N = G[ n — G ut, where 

A*G in + G m A - U*U = 0, A*G out + G out A - V*V = 0. 

Solving the tangential Nevanlinna-Pick problem simply requires checking whether N > 
0. 

10.5.2 Nevanlinna-Pick interpolation with scaling 

We now consider a simple variation of problem (10.11): Given Ai, . . . , A m with Aj € 
C+, Mi, • • • , u m , with Ui e C p and vi, . . . , v m , with Vi G C p , i = 1, . . . , m, the problem 
is to find 



7o P t 



inf < WDHD^l 



H is analytic in C+, D = D* > i 

y + ' > , (10.12) 

DeV, H(Xi)ui = Vi,i = l,...,m I 



where T> is the set of m x m block-diagonal matrices with some specified block struc- 
ture. Problem (10.12) corresponds to finding the smallest scaled H^ norm of all 
interpolants. This problem arises in multi-input multi-output Hqo control synthesis 
for systems with structured perturbations. 

With a change of variables P = D* D and the discussion in the previous section, 
it follows that 7 op t is the smallest positive 7 such that there exists P > 0, P G T> such 
that the following equations and inequality hold: 

A*G in + G in A- U*PU=0, 
A*G out + G out A-V*PV = 0, 
7 2 G in - G out > 0. 

This is a GEVP. 

10.5.3 Frequency response identification 

We consider the problem of identifying the transfer function H of a linear system 
from noisy measurements of its frequency response at a set of frequencies. We seek H 
satisfying two constraints: 
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• Consistency with measurements. For some n, with |n,| < e, we have f\ = 
H(JLUi) + rii. Here, /, is the measurement of the frequency response at frequency 
Ui, m is the (unknown) measurement error, and e is the measurement precision. 

• Prior assumption, a-shifted H^ norm of H does not exceed M. 

From Nevanlinna-Pick theory, there exist H and n with H(u>i) = fi+rii satisfying 
these conditions if and only if there exist Tij with |rij| < e, G ul > and G out > such 
that 

M 2 G in - G out > 0, 

(A + al)* G in + G in {A + al) - e*e = 0, 

(A + al)* G out + G out (A + al) - (/ + n)*(f + n) = 0, 

where A = diag(jwi, . . . , ju> m ), e = [1 ••• 1], / = [/i ••• f m ], and n = 
[ni • • • n m ]. It can be shown that these conditions are equivalent to 

M 2 G in - Gout > 0, 

{A + al)* dn + G^ {A + al) - e*e = 0, 

(A + al)* G out + G out (A + al) - (/ + »)*(/ + n) > 0, 

with \m\ < e. 

With this observation, we can answer a number of interesting questions in fre- 
quency response identification by solving EVPs and GEVPs. 

• For fixed a and e, minimize M . Solving this GEVP answers the question "Given 
a and a bound on the noise values, what is the smallest possible a- shifted 
Hqo norm of the system consistent with the measurements of the frequency 
response?" 

• For fixed a and M, minimize e. Solving this EVP answers the question "Given 
a and a bound on a- shifted Hqo norm of the system, what is the "smallest" 
possible noise such that the measurements are consistent with the given values 
of a and M. 

10.6 The Inverse Problem of Optimal Control 

Given a system 



x = Ax + Bu, x(0) = xq, 



Q 1 ' 2 
R 1 ' 2 



x 

u 



with (A, B) is stabilizable, (Q,A) is detectable and R > 0, the LQR problem is to 
determine u that minimizes 

z z dt. 

The solution of this problem can be expressed as a state-feedback u = Kx with 
K = —R~ 1 B T P, where P is the unique nonnegative solution of 

A T P +PA- PBR- 1 B T P + Q = 0. 

The inverse problem of optimal control is the following. Given a matrix K, determine 
if there exist Q > and R > 0, such that (Q, A) is detectable and u = Kx is the 
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optimal control for the corresponding LQR problem. Equivalently, we seek R > and 
Q > such that there exists P nonnegative and Pi positive-definite satisfying 

{A + BK) T P + P{A + BK) + K T RK + Q = 0, B T P + RK = 

and A T P 1 + P X A < Q. This is an LMIP in P, P x , R and Q. (The condition involving 
Pi is equivalent to (Q, A) being detectable.) 

10.7 System Realization Problems 

Consider the discrete-time minimal LTI system 

x(k+l) = Ax{k) + Bu{k), y{k) = Cx{k), (10.13) 

where x : Z + — > R", u : Z + — * R n " y : Z + — > R™". The system realization problem is 
to find a change of state coordinates x = Tx, such that the new realization 

x{k+ 1) = T- 1 ATx{k) + T- 1 Bu{k) y{k) = CTx{k), (10.14) 

satisfies two competing constraints: First, the input-to-state transfer matrix, T~ 1 (zl— 
A)^ 1 B should be "small", in order to avoid state overflow in numerical implementa- 
tion; and second, the state-to-output transfer matrix, C(zI—A)~~ 1 T should be "small", 
in order to minimize effects of state quantization at the output. 

Suppose we assume that the RMS value of the input is bounded by a and require 
the RMS value of the state to be less than one. This yields the bound 

WT^izI-Ay'BW^Kl/a, (10.15) 

where the Hoc norm of the transfer matrix H (z) is defined as 

||ff|U = sup{||ff(z)|| I \z\ >1}. 

Next, suppose that the state quantization noise is modeled as a white noise sequence 
w(k) with 'Ew(k) T w(k) = r/, injected directly into the state, and its effect on the 
output is measured by its RMS value, which is just r\ times the H 2 norm of the 
state-to-output transfer matrix: 

Pnoisc = v\\C(zI-A)- 1 T\\ 2 . (10.16) 

Our problem is then to compute T to minimize the noise power (10.16) subject to the 
overflow avoidance constraint (10.15). 

The constraint (10.15) is equivalent to the existence of P > such that 



<0. 



A T PA- P + T^T- 1 A T PB 

B T PA B T PB - 1 1 a 2 

The output noise power can be expressed as 

p 2 noise = ry 2 Tr T T W T, 

where W Q is the observability Gramian of the original system (A 7 B 7 C), given by 

Wo = J2(A T ) k C T CA k . 

fe=0 
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W is the solution of the Lyapunov equation 

A T W A -W + C T C = 0. 

With X = r p" Tr p- 1 ^ the realization problem is: Minimize r/ 2 TrW X^ x subject 
to X > and the LMI (in P > and X) 

A T PA - P + X A T PB 



B T PA B T PB-I/a 2 -"' 

This is a convex problem in X and P, and can be transformed into the EVP 
minimize r/ 2 TV YW C 



(10.17) 



subject to (10.17), P > 0, 



Y I 

I X 



> 



Similar methods can be used to handle several variations and extensions. 

Input u has RMS value bounded componentwise 

Suppose the RMS value of each component m is less than a (instead of the RMS value 
of u being less than a) and that the RMS value of the state is still required to be less 
than one. Then, using the methods of §10.9, an equivalent condition is the LMI (with 
variables X, P and R): 



<0, 



P>0, 



(10.18) 



A T PA -P + X A T PB 

B T PA B T PB - R/a 2 

where R > is diagonal, with unit trace, so that the EVP (over X , Y, P and R) is 

minimize r/ 2 TV YW C 

subject to (10.18), P > 0, X > 

R > 0, R is diagonal, TV R = 1 

Y I 



I X 



> 



Minimizing state-to-output Hqo norm 

Alternatively, we can measure the effect of the quantization noise on the output with 
the Hqo norm, that is, we can choose T to minimize \\C(zI — A)~ 1 T\\ co subject to the 
constraint (10.15). The constraint ||C(z/ — j4) _1 T|| oc < 7 is equivalent to the LMI 
(over Y > and X = T~ T T~ X > 0): 



A T YA -Y + C T C A T Y 

YA Y - X/a 2 



< 0. 



(10.19) 



Therefore choosing T to minimize \\C(zI — A) 1 T\\ CC subject to (10.15) is the GEVP 



minimize 7 
subject to P > 0, 



Y >0, 



X >0, 



(10.17), (10.19) 



An optimal state coordinate transformation T is any matrix that satisfies X opt 
(TT T )~ 1 , where X opt is an optimal value of X in the GEVP. 
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10.8 Multi-Criterion LQG 

Consider the LTI system given by 

x = Ax + Bu + w, y=Cx + v, z = 



)l/2 

R 1 ' 2 



x 

u 



where u is the control input; y is the measured output; z is the exogenous output; 
w and v are independent white noise signals with constant, positive-definite spectral 
density matrices W and V respectively; we further assume that Q > and R > 0, 
that (A,B) is controllable, and that (C,A) and (Q,A) are observable. 

The standard Linear-Quadratic Gaussian (LQG) problem is to minimize 

J, = lim F,z{t) T z(t) 

t — >oo 

over u, subject to the condition that u(t) is measurable on y(r) for r < t. The optimal 
cost is given by 

J? qg = Tr(X Hg U + QY lqs ), 

where X\ qg and Y\ qg are the unique positive-definite solutions of the Riccati equations 

A X\ qg + X\ qg A — X\ qg BR~ B X\ qg + Q = 0, 

AY lqs + Y lqg A T - Y\ qg C T V- l CY lqg + W = 0, 



(10.20) 



and U = yiqgC^V^CTiqg. 

In the multi-criterion LQG problem, we have several exogenous outputs of interest 
given by 

,1/2 

<3,>0, R t >0, i = 0,...,p. 



R, 



1/2 



X 

u 



We assume that (Qo,A) is observable. For each Zi, we associate a cost function 



J{ = lim Ezi(t) T Zi(t), i = 0, 



,p. 



(10.21) 



The multi-criterion LQG problem is to minimize J? over u subject to the measurabil- 
ity condition and the constraints J{ < ji, i = 1, . . . ,p. This is a convex optimization 
problem, whose solution is given by maximizing 

p 
Tr(X lqg U+QY lqg )~Y,^n, 

8=1 

over nonnegative T\, . . . , r p , where ^i qg and Y\ qg are the solutions of (10.20) with 



Q = Qo + yVjQi 



and 



i?=i?0 + X)Ti-Ri. 



Noting that ^"i qg > X for every X > that satisfies 

A T X + XA- XBRT 1 B T X + Q > 0, 
we conclude that the optimal cost is the maximum of 



/ p \ p 

TrlxU+iQo + ^TiQjYun] -^nn, 

V i=l / j=l 
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over X , n, . . . , t p , subject to X > 0, n > 0, . . . , r p > and 



Computing this is an EVP. 



i=l 



10.9 Nonconvex Multi- Criterion Quadratic Problems 

In this section, we consider the LTI system x = Ax + Bu, x(0) = xo, where (A, B) is 
controllable. For any u, we define a set of p + 1 cost indices Jq, . . . , J p by 



Ji(u) 



\x T u T ] 



Qi U"i 

CT Ri 



X 

u 



dt, i = 0, . . . ,p. 



Here the symmetric matrices 

Qi L'i 

C; Ri 



, i = 0,...,p, 



(10.22) 



are not necessarily positive-definite. The constrained optimal control problem is: 

minimize Jo , 

subject to Ji < 7j, i = 1, . . . ,p, x(t) — > as t — > oo 

The solution to this problem proceeds as follows: We first define 
p p v 

Q = Qo + Y^ T ^^ R=Ro + Y; T * R *> 0=00 + ^^, 

i—l i—1 i—1 

where n > 0, . . . , t p > 0. Next, with r = [ri • • • r p ], we define 

p p 

S{t, u) = J + ^ nJi - ^2 nji. 

8=1 1=1 

Then, the solution of problem (10.22), when it is not — oo, is given by: 
sup inf {S(t, u) I x(t) — » as t — > oo } 

(See the Notes and References.) 

For any fixed r, the infimum over u of S(t,u), when it is not — oo, is computed 
by solving the EVP in P = P T 

p 
maximize x(0) Px(0) — > j Tjjj 

(10.23) 



subject to 



A T P + PA + Q PB + C T 
B T P + C R 



> 



Therefore, the optimal control problem is solved by the EVP in P = P T and r: 

p 
maximize x(0) T Px(0) — > Tiji 



subject to 



A T P + PA + Q PB + C T 
B T P + C R 



> 0, n > 0, . . . , Tp > 0. 
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Notes and References 

Optimization over an affine family of linear systems 

Optimization over an affine family of transfer matrices arises in linear controller design via 
Youla's parametrization of closed- loop transfer matrices; see Boyd and Barratt [BB91] and 
the references therein. Other examples include finite-impulse-response (FIR) filter design (see 
for example [OKUI88]) and antenna array weight design [KRB89]. 

In [Kav94] Kavranoglu considers the problem of approximating in the Hoc norm a given 
transfer matrix by a constant matrix, which is a problem of the form considered in §10.1. He 
casts the problem as an EVP. 



Stability of systems with LTI perturbations 

For references about this problem, we refer the reader to the Notes and References in Chap- 
ter 3. LMI (10.6) is derived from Theorems 3 and 4 in Willems [Wil71b]. 



Positive orthant stabilizability 

The problem of positive orthant stability and holdability was extensively studied by Berman, 
Neumann and Stern [BNS89, §7.4]. They solve the positive orthant holdability problem 
when the control input is scalar; our results extend theirs to the multi-input case. The study 
of positive orthant holdability draws heavily from the theory of positive matrices, references 
for which are [BNS89] and the book by Berman and Plemmons [BP79, CH6]. 

Diagonal solutions to the Lyapunov equation play a central role in the positive orthant 
stabilizability and holdability problems. A related paper is by Geromel [Ger.85], who gives 
a computational procedure to find diagonal solutions of the Lyapunov equation. Diagonal 
quadratic Lyapunov functions also arise in the study of large-scale systems [MH78]; see 
also §3.4. A survey of results and applications of diagonal Lyapunov stability is given in 
Kaszkurewicz and Bhaya [KB93]. 

A generalization of the concept of positive orthant stability is that of invariance of polyhedral 
sets. The problem of checking whether a given polyhedron is invariant as well as the associated 
state-feedback synthesis problem have been considered by many authors, see e.g., [Bit88, 
BBT90, HB91b] and references therein. The LMI method of §10.3 can be extended to the 
polyhedral case, using the approach of [CH93]. Finally, we note that the results of §10.3 are 
easily generalized to the LDIs considered in Chapter 4. 



Stabilization of systems with time delays 

The introduction of Lyapunov functionals of the form (10.9) is due to Krasovskii [Kra56]. 
Skorodinskii [Sko90] observed that the problem of proving stability of a system with delays 
via Lyapunov-Krasovskii functionals is a convex problem. See also [PF92, WBL92, SL93B, 
FBB92]. 

It is interesting to note that the "structured stability problem" or "/i-analysis problem" 
can be cast in terms of linear systems with delays. Consider a stable LTI system with 
transfer matrix H and L inputs and outputs, connected in feedback with a diagonal transfer 
matrix A with Hoo norm less than one. This system is stable for all such A if and only if 
sup wg j> fi(H(ju))) < 1, where /i denotes the structured singular value (see [Doy82]). It can 
be shown (see [Boy86]) that if the feedback system is stable for all A which are diagonal, 
with Au(s) = e~" Ti , Tt > (i.e., delays) then the system is stable for all A with ||A||oo < 1. 
Thus, verifying sup ug j> /j,(H(ju))) < 1 can be cast as checking stability of a linear system 
with L arbitrary, positive delays. In particular we see that Krasovskii's method from 1956 
can be interpreted as a Lyapunov-based /i-analysis method. 
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Interpolation problems 

The classical Nevanlinna-Pick problem dates back at least to 1916 [PlCl6, Nev19]. There 
are two matrix versions of this problem: the matrix Nevanlinna-Pick problem [DGK79] 
and the tangential Nevanlinna-Pick problem [Fed75]. A comprehensive description of the 
Nevanlinna-Pick problem, its extensions and variations can be found in the book by Ball, 
Gohberg and Rodman [BGR90]. 

A number of researchers have studied the application of Nevanlinna-Pick theory to problems 
in system and control: Delsarte, Genin and Kamp discuss the role of the matrix Nevanlinna- 
Pick problem in circuit and systems theory [DGK81]. Zames and Francis, in [ZF83], study 
the implications of interpolation conditions in controller design; also see [OF85]. Chang 
and Pearson [CP84] solve a class of Hoo-optimal controller design problems using matrix 
Nevanlinna-Pick theory. Safonov [Saf86] reduces the design of controllers for systems with 
structured perturbations to the scaled matrix Nevanlinna-Pick problem. In [Kim87], Kimura 
reduces a class of Hoo-optimal controller design problems to the tangential Nevanlinna-Pick 
problem; see also [Bal94]. 

System identification 

The system identification problem considered in §10.5.3 can be found in [CNF92]. Other 
system identification problems can be cast in terms of LMI problems; see e.g., [PKT + 94]. 

The inverse problem of optimal control 

This problem was first considered by Kalman [Kal64], who solved it when the control u is 
scalar; see also [And66c]. It is discussed in great detail by Anderson and Moore [AM90, 
§5.6], Fujii and Narazaki [FN84] and the references therein; they solve the problem when 
the control weighting matrix R is known, by checking that the return difference inequality 

(l - B T (-iioI - A T ) _1 A") R (7 - K T (iuI - A)" 1 B) > R 

holds for all u) £ R.. Our result handles the case of unknown R, i.e., the most general form 
of the inverse optimal control problem. 

System realization problems 

For a discussion of the problem of system realizations see [MR76, Thi84], which give an- 
alytical solutions via balanced realizations for special realization problems. The book by 
Gevers and Li [GL93a] describes a number of digital filter realization problems. Liu, Skelton 
and Grigoriadis discuss the problem of optimal finite word-length controller implementa- 
tion [LSG92]. See also Rotea and Williamson [RW94B, RW94a]. 

We conjecture that the LMI approach to system realization can also incorporate the constraint 
of stability, i.e., that the (nonlinear) system not exhibit limit cycles due to quantization or 
overflow; see, e.g., [Wil74b]. 

We also note that an Hoo norm constraint on the input-to-state transfer matrix yields, 
indirectly, a bound on the peak gain from input to state; Boyd and Doyle [BD87]. This can 
be used to guarantee no overflow given a maximum peak input level. 

Multi-criterion LQG 

The multi-criterion LQG problem is discussed in [BB91]; see also the references cited there. 
Rotea [Rot93] calls this problem "generalized H2 control". He shows that one can restrict 
attention to observer-based controllers such as 

— x = {A + BK)x + YC T V~ 1 {y-Cx), u = Kx, 
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where Y is given by (10.20). The covariance of the residual state x — x is Y and the covariance 
P of the state x is given by the solution of the Lyapunov equation: 

(A + BK)P + P(A + BK) T + YC T V~^CY = 0. (10.24) 

The value of each cost index J[ defined by (10.21) is then 

J^g = Tr R t KPK T + Tr Q t (Y + P) . 

Now, taking Z — KP as a new variable, we replace (10.24) by the linear equality in W and 
P 

AP + PA T + BZ + Z T B T + YC T V~ 1 CY = 
and each cost index becomes 

J^ g =TrR i Zp- 1 Z T + TrQi(Y + P). 
The multi-criterion problem is then solved by the EVP in P, Z and X 



subject to 



>0, 



minimize Tr R X + Tr Q (P + Y) 
X Z 

Z T P 

AP + PA T +BZ + Z T B T + YC T V~ 1 CY = 0, 
TrR,X + TrQ t (P + Y)< lt , i = l,...,L 

This formulation of the multi-criterion LQG problem is the dual of our formulation; the two 
formulations can be used together in an efficient primal-dual method (see, e.g., [VB93b]). We 
also mention an algorithm devised by Zhu, Rotea and Skelton [ZRS93] to solve a similar class 
of problems. For a variation on the multi-criterion LQG problem discussed here, see [Toi84]; 
see also [TM89, Mak91]. 



Nonconvex multi-criterion quadratic problems 

This section is based on Megretsky [Meg92a, Meg92b, Meg93] and Yakubovich [Yak92]. 
In our discussion we omitted important technical details, such as constraint regularity, which 
is covered in these articles. Also, Yakubovich gives conditions under which the infimum 
is actually a minimum (that is, there exists an optimal control law that achieves the best 
performance). The EVP (10.23) is derived from Theorem 3 in Willems [Wil71b]. 

Many of the sufficient conditions for NLDIs in Chapters 5-7 can be shown to be necessary 
and sufficient conditions when we consider integral constraints on p and q, by analyzing them 
as nonconvex multi-criterion quadratic problems. This is illustrated in the following section. 



Mixed H^-Hqo problem 

We consider the analog of an NLDI with integral quadratic constraint: 



x — Ax + B p p + B u u, q — C q x, 



)l/2 

R 1/2 



X 

u 



p p dt < / q q dt 
Jo 
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Define the cost function J = J z T z dt. Our goal is to determine min^ max p J, over all 
possible linear state-feedback strategies u = Kx. (Note that this is the same as the prob- 
lem considered in §7.4.1 with integral constraints on p and q instead of pointwise in time 
constraints.) 

For fixed K, finding the maximum of J over admissible p's is done using the results of §10.9. 
This maximum is obtained by solving the EVP 



minimize ^(0) Px(0) 

(A + B U K) T P + P(A + B U K) + 
Q + K T RK + rCjC q 



subject to 



PB P 
-rl 



<0, 



r > 



The minimization over K is now simply done by introducing the new variables W — P~ l , 
Y — KW , fi — 1/t. Indeed, in this particular case, we know we can add the constraint 
P > 0. The corresponding EVP is then 



minimize x(0) T W 1 x(0) 



subject to 



WA T + YBl + AW + B U Y+ 

WQW + Y T RY + WC^CqW/fi 

fj,B p 



fiBp 
-Hi 



<o, 



H> 



which is the same as the EVP (7.23). 
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RTy k -r) mxn 
, xv , XV 

R+ 

C 

Re (a) 
S 

h 

M T 
M* 

TrM 
M > 

M > 

M > N 
M 1 ' 2 

A m ax(M) 
Amajt(j i Q) 

A mi „(M) 
l|M|| 

diag(- • •) 



The real numbers, real fc-vectors, real mxn matrices. 

The nonnegative real numbers. 

The complex numbers. 

The real part of a £ C, i.e., (a + o*)/2. 

The closure of a set S C R". 

The k x k identity matrix. The subscript is omitted when k is not 
relevant or can be determined from context. 
Transpose of a matrix M: (M T )ij = M^. 

Complex-conjugate transpose of a matrix M: (M*)ij = M* i; where 
a* denotes the complex-conjugate of a £ C. 

Trace of M £ R" XTl , i.e., ]T™ =1 M u . 

M is symmetric and positive semidefinite, i.e., M = M T and 
z T Mz > for all z £ R". 

M is symmetric and positive-definite, i.e., z T Mz > for all nonzero 
z £ R . 

M and N are symmetric and M — N > 0. 

For M > 0, M 1 / 2 is the unique Z = Z T such that Z > 0, Z 2 = M. 

The maximum eigenvalue of the matrix M = M T . 

For P = P T , Q = Q T > 0, A max (P, Q) denotes the maximum eigen- 
value of the symmetric pencil (P,Q), i.e., A max (Q _1 ' 2 P(5 _1 ' 2 ). 

The minimum eigenvalue of M = M T . 

The spectral norm of a matrix or vector M, i.e., yA max (M T M). 
Reduces to the Euclidean norm, i.e., ||x|| = vi t i, for a vector x. 

Block-diagonal matrix formed from the arguments, i.e., 

Mi 



diag(Mi,...,M m ) 



A 



M„ 
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CoS 



Ex 



Notation 



Convex hull of the set S C R n , given by 



CoS^ J> 



Xi s 5, p > 



(Without loss of generality, we can take p = n + 1 here.) 
Expected value of (the random variable) x. 
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ARE Algebraic Riccati Equation 

CP Convex Minimization Problem 

DI Differential Inclusion 

DNLDI Diagonal Norm-bound Linear Differential Inclusion 

EVP Eigenvalue Minimization Problem 

GEVP Generalized Eigenvalue Minimization Problem 

LDI Linear Differential Inclusion 

LMI Linear Matrix Inequality 

LMIP Linear Matrix Inequality Problem 

LQG Linear-Quadratic Gaussian 

LQR Linear-Quadratic Regulator 

LTI Linear Time-invariant 

NLDI Norm-bound Linear Differential Inclusion 

PLDI Polytopic Linear Differential Inclusion 

PR Positive-Real 

RMS Root-Mean-Square 
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system with multiplicative noise, 132 

system with unknown parameters, 125 
Stability margin 

LDI, 65 

mean-square, 132, 137 
Stabilizability 

LDI, 100 

positive orthant, 144 

system with multiplicative noise, 134 
Stabilization problem 

complexity, 128 
Standard LMI problems, 9 
*, 143 
State, 52 
State-feedback 

controller, 99 

global linearization, 99 

in linear system with delay, 145 

LDI, 99 

nonlinear system, 99 

positive orthant stability, 144 

scaling, 111 

synthesis, 99 

system with multiplicative noise, 134 
State properties 

LDI, 61 

Lur'e system, 120 

state-feedback, 100 

system with multiplicative noise, 131 
State-to-output properties 

filter realization, 148 

LDI, 84 

Lur'e system, 121 

state-feedback, 107 
Static state-feedback, 99 
Step 

length, 28 

response, 97 
Stochastic stability margin, 132 
Stochastic system 

continuous-time, 136 
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discrete-time, 131 
Stopping criterion 

method of centers, 17 
Stratonovitch stochastic system, 136 
Strict feasibility 

reduction to, 19 
Strict LMI, 18 
Structured 

feedback, 54, 152 

singular value, 152 
Structured uncertainty 

interpolation, 146 

matrix problem, 39 
Sturm method, 26, 35 
Symmetric polytope, 48 
Synthesis 

output-feedback, 117, 150 

state-feedback, 99, 134 
System 

realization, 148 

stochastic, 136 

uncertain, 56 

with multiplicative noise, 131 
System theory duality, 5 
System with unknown parameters 

matrix problem, 39 

multipliers for, 124 



Unknown parameters 
system with, 124 

V 

Value function, 112 
Variable 

elimination, 9 

matrix, 8 

slack, 8 
Vertex 

of polytope, 41 
vol, 13 
Volume 

ellipsoid algorithm, 13 

invariant ellipsoid, 69 

of ellipsoid, 41 

w 

Well-posedness 

DNLDI, 54, 57 

NLDI, 53, 63 
White noise 

Brownian motion, 137 

uncertainty, 131 



Youla parametrization, 152 



Tangential Nevanlinna-Pick interpolation, 

145 
Time- varying 

feedback matrix, 53 

system, 51 
Tr, 5 
Trace, 5 
Transfer matrix, 25 

LTI system, 64, 91 

optimization, 141 

positive-real, 25 
Transformation 

loop, 119 
Tsypkin criterion, 2 

u 

Uncertainty 

LTI, 143 

parametric, 124 

random, 131 

time-varying, 51 
Unit-energy input 

LDI, 77 

Lur'e system, 121 

state-feedback, 104 

system with multiplicative noise, 132 

system with unknown parameters, 125 
Unit-intensity white noise, 138 
Unit-peak input, 97 

LDI, 82 

state-feedback, 107 
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